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David Waxman, Fudan U. 
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Chris Wikle, U. of Missouri ($) 
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Kenneth Zaret, U. of Pennsylvania School of Medicine 
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David Bloom, Harvard U., Samuel Bowring, MIT. Angela Creager, Princeton U., 
Richard Shweder, U. of Chicago, Ed Wasserman, DuPont 
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Passion ts Just the start 


ast month, 65 Nobel laureates gathered in 
Lindau, Germany, for an annual exchange of 
knowledge between these most honored scientists 
and a group of selected young researchers from 
around the globe. To extend the reach of the 
conference, Science hosted a webinar to address a 
common concern of young scientists everywhere: 
persevering in science. Some advice on this topic may 


obstacles, periods of self-doubt, and career setbacks. But 
if a young scientist faces despair from too many failed 
academic job searches, one panelist had some tough ad- 
vice: It’s time to reassess options as to whether academia 
is the right track. The panelists encouraged thoughtful 
consideration of industry and government research labs, 
because in many cases, they offer more stable funding, 
and an impressive number of Nobel laureates came from 
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: Marcia McNutt 

have been unexpected. these environments. : . . 

Elizabeth Blackburn and a A large part of persever- Editor-in-Chief 
Jack Szostak (both 2009 ing in science is balancing a Science Journals 
laureates in Physiology or demanding scientific career 
Medicine, for the discovery with personal life. Panelists 
of how chromosomes are emphasized the importance 
protected by telomeres) and of flexibility; adjusting 
Daniel Shechtman (2011 lau- one’s scientific career to 
reate in Chemistry, for the meet the needs of a fam- 
discovery of quasi-periodic ily, for example. I certainly 
crystals) fielded questions resonated with this advice. 
submitted from more than A seagoing oceanographer 
3100 young scientists who early in my career, I decided 
registered for the discussion. to move into a different po- 
Not lost on these Nobelists sition that allowed me to 
was the discouragement stay ashore when my three 
that young scientists feel daughters reached the age 
about their career pros- when having their mother 
pects. Many find themselves home regularly really mat- 
in serial postdocs for many a F : tered. Once they all were 
years while fielding rounds lf a young scientist ‘faces done with college, I was 
of rejection letters for aca- ° on ge open to job opportunities 
demic positions. In the face desp atr...It's time to reassess for which more time away 
of this disheartening situa- options...” from home was required. 
tion, there was some tough For young scientists in 
advice. Given the current competitive climate for hiring | particularly competitive fields, they are challenged with 
and advancement, the panel agreed that young scientists doing everything they can to stand out from the crowd. 
who are not deeply passionate about their research need In this regard, I am delighted by the growing number 
to reconsider their career choices. And even if one has of events that are devoted to helping young scientists 
such extraordinary passion, it alone is not enough. The | network with very influential leaders who might help 
panelists stressed the importance of becoming a lead- advance their careers (such as the Lindau Nobel Laure- 
ing expert in something new and in demand. They also ate Meeting) and prizes that recognize the early-career 
emphasized the need to hone people skills, as science is accomplishments of young scientists (such as the Sci- 
more and more a team effort, and to become able com- LifeLab Prize and the Eppendorf Prize, both of which 
municators in order to share goals and achievements are administered in conjunction with Science). Advisors 
with potential funders and the public. Graduate students and mentors may not have the time to look out for such 
also need to be smart in selecting a lab for postdoctoral | resume-enhancing possibilities for their trainees. I en- 
training; ideal attributes are a collaborative atmosphere, courage ambitious young scientists to investigate these 
opportunities for leadership in publications, and expo- | opportunities. 
sure to more than just one senior leader in the field who Many Nobel Prize winners spoke about their “good 
will be able to write a letter of recommendation. luck,” but on closer inspection, in most cases, they had 

Certainly, the path to a successful academic career, no | created the conditions for their own good luck to happen. 
less to a Nobel Prize, is not easy. The panelists also faced - Marcia McNutt 

10.1126/science.aac9472 
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Plants in cold storage 


National Museum of Natural History curator Vicki Funk and a summer field team member collect 
samples from the U.S. Botanic Garden. 


n the race to preserve biodiversity even as species extinction rates 

soar, Smithsonian Institution scientists are starting with their 

own backyards. Plants from the U.S. Botanic Garden, the U.S. 

National Arboretum, and the Smithsonian Gardens—all located in 

Washington, D.C.—are the first targets of an initiative to capture 

the genomic diversity of half of the world’s living plant genera in 
less than 2 years. On 8 July, the Smithsonian’s National Museum of 
Natural History launched the effort, which will include a summer field 
team of students who will assist scientists with the museum’s Global 
Genome Initiative in sampling plants from the gardens’ holdings. The 
scientists will then preserve them in cryogenic vials and store them in 
liquid nitrogen. GGI, the Norway-based Svalbard Global Seed Vault, and 
the Royal Botanic Gardens’ Millennium Seed Bank in the United States 
are three of the largest ex situ conservation efforts to preserve genetic 
material outside of its natural environment. Ex situ conservation is a 
strategy described in the Intergovernmental Panel on Climate Change’s 
2014 Fifth Assessment Report as an insurance policy against climate 
change and other potential sources of biodiversity loss. 
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&4|'ve secretly been working on a lander. 99 


New Horizons principal investigator Alan Stern, at a press conference 
14 July as the spacecraft made its closest approach to Pluto. 
Stern was joking with reporters about plans to return to the dwarf planet. 


AROUND THE WORLD 
“Cures” bill clears U.S. House 


WASHINGTON, D.c. | A bill to speed 

the discovery and development of new 
medical treatments sailed through the 
U.S. House of Representatives last week. 
The legislation, known as the 21st Century 
Cures Act, calls for a temporary “innova- 
tion fund” that would give the National 
Institutes of Health an extra $8.75 billion 
over 5 years. It also aims to smooth the 
approval of high-priority treatments; for 
example, one provision allows the Food 
and Drug Administration to approve anti- 
biotics for rare life-threatening diseases 
based on smaller clinical trials. Although 
the bill enjoys the backing of hundreds 

of industry, research, and patient orga- 
nizations, it has also prompted concerns 
that the effort to speed cures might 
unintentionally compromise patients’ 
safety. The updated version of the bill will 
now head to the Senate, which has yet to 
release its own biomedical innovation bill. 
http://scim.ag/housecures 


Helium rules stifle competition 


WASHINGTON, D.c. | Newrules on selling 
off a U.S. government cache of helium— 
essential for many types of technology 

and scientific research—aren’t working as 
planned, witnesses said last week before 
the Committee on Natural Resources in the 
US. House of Representatives. The 2013 
Helium Stewardship Act aimed to establish 
a competitive market for federal helium by 
phasing in an auction instead of having the 
US. Bureau of Land Management (BLM) 
sell it for a fixed price. But the law may 

be stifling competition, the panel said, by 
giving an unfair advantage to companies 
that have storage contracts with BLM to 
receive the crude helium and refine it. 
“Nonrefiners” who buy federal helium must 
also work out a “tolling” deal to get one of 
the refiners to process it. BLM previously 
reserved 10% of helium for fixed-price 
“nonallocated” sales to nonrefiners, but at 
BLM’s first helium auction in July 2014, 
two refiners bought it all, shutting the nine 
nonrefiners out of the bidding. 
http://scim.ag/heliumrules 
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Appeals court dodges H5N1 issue 


AMSTERDAM | Influenza researcher Ron 
Fouchier of Erasmus MC in Rotterdam, 

the Netherlands, received a disappointing 
verdict in his legal battle over so-called gain- 
of-function research, in which flu strains 

are made more transmissible in the lab. 

In 2012, the Dutch government demanded 
that Fouchier seek an export license before 
submitting a highly controversial H5N1 

flu study to Science. Fouchier obtained the 
license under protest, and Erasmus MC 
sued the government for infringement of 
academic freedom—a claim a district court 
rejected in 2013 (Science, 8 November 2013, 
p. 676). The case went to Amsterdam’s Court 
of Appeal, which annulled the lower court’s 
decision last week, Fouchier says, because 
Erasmus MC had no standing to sue once it 
had obtained the license. The appeals court 
did not rule on the key issue of whether 
Fouchier’s research is subject to export con- 
trol. The verdict was not yet public as Science 
went to press. http://scim.ag/FouchierH5N1 


Deal struck on Iran nukes 


VIENNA | Iran this week agreed to disman- 
tle large pieces of its nuclear program in 
exchange for relief from crippling economic 
sanctions—with science as an important 
beneficiary. The agreement, 2 years in 

the making, would slow Iran’s “breakout 
time’—the time needed to produce enough 
weapons-grade fissile material for one 
bomb—from an estimated 2 to 3 months to 
at least a year. Iran will convert its sensitive 
Fordow uranium enrichment facility into 
an international “nuclear, physics, and 
technology centre,” with Russia helping to 
reconfigure two centrifuge cascades to pro- 
duce stable isotopes for industry. Fordow 
will also host small linear accelerators for 
basic research in nuclear physics and astro- 
physics. Proposals for collaborative projects 
at Fordow will be reviewed at an interna- 
tional workshop. The deal also calls for the 
possibility of having Iran participate in the 
International Thermonuclear Experimental 
Reactor, or ITER, under construction in 
France. http://scim.ag/lranFordow 


NEWSMAKERS 
Plant biologist altered images 


Two investigations found many instances 

of image manipulation in papers by Olivier 
Voinnet, a plant biologist at CNRS, the 
French national research agency, who is cur- 
rently working at the Swiss Federal Institute 
of Technology Zurich (ETH). Voinnet’s 

work came under attack in December on 
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cientists keep track of the population density of leopards in the wild using camera 

traps; they snap pictures of the cats and identify individuals by their character- 

istic spot patterns. But, due to a recessive gene that causes melanism, most of 

the leopards prowling the Malay Peninsula are black—even their spots—making 

it nearly impossible to distinguish them using the cameras. Now, there's a work- 
around, scientists report in the Journal of Wildlife Management. The spots are visible 
at infrared wavelengths, so by modifying infrared flash camera traps on the peninsula 
so that they were forced into night mode throughout the day, the researchers were ulti- 
mately able to distinguish distinct spot patterns and identify 94% of the animals along 
a wildlife corridor in Malaysia—crucial for keeping tabs on the region's population over 
time. The work not only represents the first leopard density estimate in Malaysia, the 
authors note, but also the first successful attempt to estimate population size using 
melanistic phenotypes—such as a black leopard’s dark spots. 


a website called PubPeer; a CNRS inves- 
tigation that won’t be made public found 
manipulations that constituted scientific 
misconduct and “tarnished” CNRS’s reputa- 
tion, the agency said on 10 July. CNRS 
suspended Voinnet for 2 years. On the 
same day, ETH announced that a panel had 
found problems in 20 of Voinnet’s papers, 
but that those produced while he was at 
ETH did not constitute misconduct under 
the institute’s definition. Voinnet received 
an “admonition” but will keep his position 
at ETH. In a statement, he apologized and 
took “complete responsibility.” 
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FINDINGS 
Cholera vaccine shows promise 


An inexpensive cholera vaccine may help 
prevent the deadly disease in the more 
than 50 countries, mostly in Asia and 
Africa, where it is endemic, according to 
a new study in The Lancet. Cholera causes 
severe diarrhea and is estimated to kill 
about 95,000 people—a large number of 
whom are children—every year. In the 
first-of-its-kind study, scientists tested 
the vaccine, called Shanchol and made in 
India, in 270,000 people in the slums of 
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Dhaka, where the threat of contracting the 
infection is nearly constant. They found 
that the vaccine provided individuals more 
than 50% protection against cholera and 
lowered life-threatening episodes of the 
infection by about 40%. When given as 
part of routine health services, the vaccine, 
which consists of a two-dose regimen that 
currently costs $3.70, could drive down the 
toll considerably, the researchers say. 


A flightless, winged dino 


Researchers have found the largest ever 
dinosaur with full-fledged wings and 
feathers, they report online this week in 
Scientific Reports. The 125 million-year- 
old dino, called Zhenyuanlong suni and 
found in China, was about 1.65 meters 
long, a little longer than a modern condor. 
It belonged to a group of dinos called 
dromaeosaurs, which includes Velociraptor 
from Jurassic Park, and which was 

closely related to early birds. But although 
Zhenyuanlong’s wings had multiple layers 
of birdlike feathers, they were very short 
compared with those of most other winged 


BY THE NUMBERS 


15 


million 
Number of people receiving 
antiretrovirals for HIV in March 2015, 


according to anew UNAIDS report 
released this week. 


834,000 


Number of rabbits, nonhuman pri- 
mates, and other regulated animals 
used in U.S. biomedical research last 
year—a drop of 6% from the previous 
year, and the lowest number since 
data collection began in 1972. 


18.7% 


Increase in global wildfire activity 
over the last 35 years, according to a 
study in Nature Climate Change. 
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The short-armed, winged, and feathered dinosaur Zhenyuanlong suni. 


dinos, and thus it was probably unable 

to fly. That leaves the discovery team 
wondering what the wings were for. One 
possibility is that it evolved from ancestors 
that could fly, but used its wings mainly for 
sexual display. http://scim.ag/flightlessdino 


Enter the pentaquark 


GENEVA, SWITZERLAND | A long search 
paid off this week, when researchers 
at the European high-energy physics 


aa 


Scientists (top) display a wafer made of 7-nanometer chips (inset). 


laboratory CERN announced 
that they had sighted strong 
evidence of particles made 
up of five quarks. Predicted 
by physicist Murray Gell- 
Mann in 1964 to exist 
alongside protons and 
neutrons (which contain 
three quarks) and mesons 
(which are quark-antiquark 
pairs), pentaquarks would 
consist of four quarks and an 
antiquark. Scientists thought 
they had discovered a light 
pentaquark known as theta- 
plus in 2003, but those claims evaporated 
under further scrutiny (Science, 22 April 
2005, p. 478). Physicists at the Large 
Hadron Collider’s LHCb experiment 
instead snared a heavier “charmonium” 
pentaquark by sifting through thousands 
of decays of three-quark lambda-b par- 
ticles. The finding, which still has to be 
confirmed by other groups, suggests the 
existence of a variety of pentaquarks 
with different masses, the scientists say. 
http://scim.ag/pentaquark 


pre 


IBM back on track with Moore’s law 


BM announced last week that it has manufactured working versions of a new, ultra- 
dense computer chip that will reduce the area needed for a given amount of circuitry 
by half and has four times the capacity of the most powerful chips on the market. 
Currently, commercial chip technology is at the “l4-nanometer manufacturing” stage, 
referring to the size of the chip's smallest features. The new IBM chips have 7-nano- 
meter transistors, making it possible to pack more into the same space and increasing 


the capacity. The new chips were manufactured by using silicon-germanium in some 
regions rather than pure silicon, which the com- 


pany says reduces power requirements 

and makes faster transistor switching 
possible. But IBM hasn't said when the new 
chips are likely to be commercially available. 
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007-pratest denouncing 
faterboarding as torture; 

— 
> several psychologists 
ided the method’s used 
on U Siimilitatydetainees. 


Torture report prompts APA apology 


Admitting it colluded with U.S., psychologists group to change policies, leadership 


By John Bohannon 


fter years of denying that it had given 

scientific and ethical legitimacy to 

torture by the U.S. government, the 

American Psychological Association 

(APA) last week accepted the find- 

ing of an external investigation 
that concluded it had done just that. Now, 
with a public apology and sudden wave 
of high-level resignations or retirements, 
APA is struggling to craft an institutional 
response that will satisfy its members and 
long-time detractors, even as some of those 
pilloried in the probe defend themselves 
and their colleagues. 

“This is a crisis,’ says Nadine Kaslow, 
a psychologist at Emory University in At- 
lanta and a former APA president, who 
helped launch the investigation. “I regret 
that the organization didn’t listen to the 
critics earlier.” 

The 542-page report from a former Chi- 
cago inspector general, David Hoffman, 
pulls no punches, concluding that APA 
officials colluded with the U.S. govern- 
ment to enable the torture of detainees. 
APA’s Board of Directors quickly released 
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a response, promising among other things 
to recommend a new policy prohibiting 
psychologists from participating in interro- 
gation of persons held in custody by mili- 
tary and intelligence authorities. APA then 
announced the departure of most of its 
staff leadership: CEO Norman Anderson, 
Deputy CEO Michael Honaker, public rela- 
tions director Rhea Farberman, and ethics 
director Stephen Behnke. 

“All of these people were in the know,’ 
says former APA President Gerald Koocher, 
a psychologist at DePaul University in Chi- 
cago, Illinois, and the editor of the jour- 
nal Ethics & Behavior. Koocher himself 
is under fire, as two of APA’s staunchest 
critics have called for the body to censure 
him as well, after receiving a confidential 
briefing on the Hoffman report earlier this 
summer. 

APA had commissioned the report last 
year after the publication of Pay Any 
Price: Greed, Power and Endless War, a 
book by New York Times reporter James 
Risen that accused the organization of 
providing cover for torture. The report’s 
most damning findings concern a 2005 
APA committee called the Task Force on 
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Psychological Ethics and National Secu- 
rity (PENS). The task force was created 
in the midst of revelations that detainees 
were subjected to “enhanced interroga- 
tion” at U.S. government facilities in Iraq, 
Afghanistan, and Guantanamo Bay, Cuba, 
and that psychologists were intimately in- 
volved in both the design and practice of 
these efforts. 

As Hoffman discovered through inter- 
views, medics within the intelligence com- 
munity were “not on board” with such 
interrogations. To quell this internal re- 
sistance, the government hoped to enlist 
support from APA, psychology’s largest pro- 
fessional organization. And the PENS task 
force provided it, concluding in a 2005 state- 
ment that it was ethical for psychologists to 
take part in the interrogation program. 

The PENS decision sparked protests by 
many APA members, some of whom called 
for withholding dues, but Hoffman found 
that they were ignored. “Being involved 
in the intentional harming of detainees ... 
could do lasting damage to the integrity 
and reputation of psychology, a profession 
that purports to ‘do no harm,” he writes, 
but “these countervailing concerns were 
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simply not considered or were highly sub- 
ordinated to APA’s strategic goals.” Accord- 
ing to Hoffman, APA sought to maintain its 
privileged relationship with the Pentagon, 
a massive employer of psychologists. 

Hoffman’s analysis of internal APA 
emails found that the members of the 
PENS task force were carefully chosen in 
a collaboration involving officials at APA, 
the Pentagon, and the Central Intelligence 
Agency, and its conclusions were vetted in 
advance by insiders at both agencies. The 
goal of PENS, Hoffman offers, was not to 
examine the ethics of torture but to “curry 
favor” with the U.S. Defense Department. 

Hoffman’s characterization of PENS is 
unfair, according to Koocher, who was 
one of the architects of the task force. “We 
solicited widely and openly for member- 
ship,” he says. The fact that so many task 
force members came from the military is 
not evidence of collusion but good judg- 
ment. “If you're focusing on interroga- 
tion in a military context then those are 
the people with the relevant expertise.” As 
for the allegation of currying favor with 
the Pentagon, Koocher is adamant that it 
was not his goal. “No way were we cover- 
ing up for [Vice President] Cheney or [De- 
fense Secretary] Rumsfeld, both of whom I 
cannot stand.” 

Koocher says that he was unaware that 
the torture was ongoing. He points out that 
he, along with other representatives of U.S. 
medical associations, visited the detention 
center at Guantanamo in 2006. “I asked 
hard questions,” he says. When it was later 
revealed that torture continued at the facil- 
ity, “I was extremely upset.” But by then, he 
says, “I was no longer an APA official. What 
was I supposed to do?” 

That sentiment may not save Koocher 
from sanctions. He is on a list of APA mem- 
bers to be banned from APA governance 
“effective immediately”—just one of several 
recommendations from Steven Reisner of 
New York University and Stephen Soldz 
of the Boston Graduate School of Psycho- 
analysis, who also urged that APA’s top ex- 
ecutive, legal, and public relations staff be 
fired. Reisner and Soldz, persistent critics 
of APA’s role in the interrogation program, 
were invited by APA to review the Hoffman 
report in advance and give the society their 
feedback. APA wouldn’t comment specifi- 
cally on the pair’s recommendations; sev- 
eral people on their “staff to be fired” list 
remain with APA. “A lot of change can hap- 
pen, but it will take a lot of time to imple- 
ment it,’ Kaslow says. 

APA’s 180° turn is only a start, Soldz says. 
“The APA and the entire psychology profes- 
sion needs to grapple with the enormous 
scandal enveloping psychological ethics.” m 
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Researchers seek clear reasons 
when clinical trials end early 


Explanations for abandoning tests of new treatments 
earlier than planned are often hazy 


By Jennifer Couzin-Frankel 


ike a marathon with far more run- 

ners limbering up at the start than 

stumbling across the finish, the race 

to bring a new treatment to market 

has dropouts along the way. About 

12% of clinical trials are reported 
to shut down prematurely. Knowing why 
could help minimize the number of termi- 
nated trials going forward. 

But a paper published earlier this month 
by a group of computational biologists 
suggests that this knowledge isn’t easy 
to come by. The main reason: Companies 
can type whatever they want into the 
tiny space—a mere 160 characters—allot- 
ted on ClinicalTrials.gov, a registry main- 
tained by the National 
Library of Medicine. 
In their analysis of all 
3122 terminated trials 
on the registry at the 
time their study began, 
“it just seemed like a 
complete mess,” says 
Frederick Roth, a systems biologist and 
geneticist at the University of Toronto in 
Canada. Reasons were often murky, rang- 
ing from “it was decided to not proceed 
with the study at this time” to “SARS epi- 
demic in Asia and Canada.” 

While at Harvard University, Roth and 
two undergraduates came to their project 
quite by accident. They were interested in 
adverse interactions between drugs or be- 
tween drugs and genes. They decided to do 
what he calls an “amateur pass” through 
ClinicalTrials.gov. 

Their search didn’t yield much. The rea- 
sons for termination were so diverse and 
often so vague that Roth and his students 
decided to launch a new project: learn 
more about why clinical trials end early by 
dividing the information given in the short 
blurbs into “buckets,” such as funding, eth- 
ical reasons, or business decisions, so they 
could see the breakdown by category. They 
found that by far the most popular reason 
was insufficient enrollment, accounting for 
about one-third of terminated trials. About 
11% failed to establish efficacy. In all, Roth 
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“Tt just seemed like a 
complete mess.” 


Frederick Roth, University of Toronto 


and his students identified 35 categories, 
among them “lost interest,’ “inadequate 
design,” and “key staff left.” (In one case, 
an orthopedic surgeon performing a trial’s 
knee surgeries moved elsewhere.) “It’s a 
potpourri of reasons why you terminate,” 
agrees Deborah Zarin, the director of 
ClinicalTrials.gov. 

In the new paper, published online 
at bioRxiv.org, Roth and first author 
Theodore Pak, now an M.D./Ph.D. student 
at the Icahn School of Medicine at Mount 
Sinai in New York City, recommend that 
ClinicalTrials.gov aim for greater transpar- 
ency by asking sponsors to answer several 
questions when their trials end early. Those 
include whether it even started, whether 
data were ever examined, and whether in- 
terim examinations of 
efficacy or safety played 
a role. 

Zarin welcomes Roth’s 
dive into the myriad ex- 
planations, but she is 
cautious about the ques- 
tions the authors recom- 
mend for trial sponsors. Safety and efficacy 
can’t always be easily separated from each 
other, she says. Zarin is especially interested 
in distinguishing between trials that end be- 
cause of the science versus some other rea- 
son: In May, she and her colleagues reported 
in PLOS ONE that 68% of 905 terminated 
trials with results listed on ClinicalTrials.gov 
stopped for reasons other than data accumu- 
lated in the study. Only one-fifth ended early 
because of safety or efficacy concerns from 
trial data, and the rest didn’t give a reason. 

Another unsettling issue is that many 
terminated studies aren’t listed as such 
in ClinicalTrials.gov and other databases 
because the entries are not up to date, ac- 
cording to ongoing research by clinical 
epidemiologist Matthias Briel at the Uni- 
versity Hospital of Basel in Switzerland. 
That, combined with sometimes squishy 
reasons offered for trial termination, sug- 
gests to him that one implication of Roth’s 
paper is that “people might then take infor- 
mation from these registries and introduce 
inaccuracies at least, or even bias into their 
own studies, without realizing it.” B 
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The World Health Assembly rejected a proposal to increase membership fees in May. 


Report prescribes strong medicine for WHO 


Ebola failures show that difficult reforms are needed, an independent panel says 


By Kai Kupferschmidt 


he World Health Organization (WHO) 

is still struggling to end the Ebola out- 

break in West Africa. But in the wake 

of what many viewed as a sluggish re- 

sponse to the crisis, another battle is 

brewing—about the future role of the 
organization itself. 

On 7 July, an independent panel delivered 
a scathing review of WHO’s performance and 
proposed wide-ranging reforms that would 
enable the agency to better tackle the next 
major health crisis—from giving it more 
money and power to setting up a special, 
semi-independent emergency center. WHO 
“does not currently possess the capacity or 
organizational culture to deliver a full emer- 
gency public health response,’ concluded 
the group, led by Dame Barbara Stocking, 
the former chief executive of Oxfam in the 
United Kingdom; it recommended that the 
agency be made “fit for purpose.” 

But although many of the suggestions 
have been praised as sensible, WHO’s com- 
plex, politicized governance structure and 
entrenched bureaucracy make the $2 billion 
U.N. agency difficult to change. Just 2 months 
ago, for instance, the World Health Assembly 
(WHA)—the annual meeting of health minis- 
ters that is WHO’s decision-making body—re- 
jected a proposal by WHO Director-General 
Margaret Chan to increase member states’ 
contribution fees by 5%, as the report rec- 
ommends. Several other suggestions in the 
report were previously made by a panel that 
reviewed the H1N1 pandemic of 2009, but 
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were never implemented. 

The new report, too, “might just die on 
paper,’ says Joanne Liu, president of Doctors 
Without Borders (MSF), which has played 
a major role in the Ebola epidemic. “Every- 
thing at this stage is in the hands of the mem- 
ber states and how willing they are to give a 
second chance to WHO.” 

The six-member panel, appointed by 
WHO’s Executive Board in March, inter- 
viewed WHO sources and outside experts, 
met with representatives of relief organiza- 
tions, and flew to the affected countries. It 
explored why WHO waited until 8 August 
2014, when Ebola had already infected more 
than 1000 people, to declare the outbreak a 
Public Health Emergency of International 
Concern, a status that triggers international 
action. Early warnings about the outbreak’s 
scale within WHO were ignored, the panel 
found, partly because “WHO does not have 
an organizational culture that supports open 
and critical dialogue between senior leaders 
and staff or that permits risk-taking.” 

In addition, local community leaders— 
whose collaboration would prove crucial to 
controlling Ebola—were engaged too late, 
deployments of staff and consultants in the 
field were too short, and communication 
with the media was ineffective. 

The report’s most far-reaching proposal 
is that WHO set up a new Centre for Health 
Emergency Preparedness and Response that 
would bring together WHO experts in out- 
break control and those providing humani- 
tarian assistance, two areas that are now 
separate. The center should be overseen by an 
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independent board and headed by “a strong 
leader and strategic thinker, the report 
notes. It’s a good proposal, says Norwegian 
epidemiologist Preben Aavitsland, the head 
of a researching and consulting company 
in Kristiansand, Norway. But WHO officials 
may resist the idea, because a strong emer- 
gency center could become a state within a 
state, he cautions. “It will be interesting to 
see whether that is followed up.” 

Others say WHO does not need an internal 
capacity to mount a full-on response to every 
outbreak. Rather than expanding, the organi- 
zation should depend more on external part- 
ners like the U.S. Centers for Disease Control 
and Prevention—which has three times 
WHO’s annual budget—or Public Health 
England, says David Heymann, a former 
WHO assistant director-general who is now 
at the London School of Hygiene & Tropical 
Medicine. “To be sustainable, the workforce 
should be outside of WHO,” he says. “I would 
be totally against WHO increasing its staff to 
sit around and wait for outbreaks to occur.” 

The report also stresses the need to boost 
countries’ compliance with the International 
Health Regulations (IHR), which lay out the 
responsibilities of WHO and member states 
during outbreaks. Completely revised and 
modernized after the SARS outbreak of 2003, 
the current IHR require member states to 
have “core public health capacities’—such 
as laboratories and trained staff—to detect 
and fight infectious diseases. So far, only 
64 of 193 member states have confirmed that 
they comply with these rules; WHO should 
develop a plan to bring all countries up to 
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speed, the report says. 

That is easier said than done in dirt-poor 
countries where even basic medical services 
are lacking, Liu says. “It’s this long, huge 
shopping list of things that you are sup- 
posed to implement, with no prioritization,” 
she says. WHO should narrow the require- 
ments down to the essentials, Liu argues. 
Aavitsland adds that developed countries 
should help foot the bill. 

In addition, the report says, WHO should 
penalize countries that violate an IHR pro- 
vision banning member states from issu- 
ing travel restrictions if WHO doesn’t deem 
them warranted. More than 40 countries 
imposed unnecessary restrictions during the 
Ebola outbreak, which hurt the economies in 
Guinea, Sierra Leone, and Liberia, and made 
it harder to send people and supplies. The re- 
port recommends giving WHO the power to 
sanction such behavior and to take cases to 
the U.N. Security Council. But that requires 
countries to surrender some national powers, 
something politicians are notoriously reluc- 
tant to do. “The current IHR were the result 
of a political negotiation that took much 
time,’ Heymann says. “In order to change 
them, there have to be new negotiations.” 

A spokesperson for WHO says “several 
of the recommendations in the report have 
been addressed already or are being ad- 
dressed.” For instance, the organization 
is looking seriously at establishing a new 
emergency center, he says. Whether any real 
change is possible will become clear at the 
next WHA in May 2016, Aavitsland says. The 
issue of WHO’s budget will surely come up 
again then as well. “Member states keep in- 
creasing their expectations of what WHO will 
deliver, in many areas, without giving the or- 
ganization the means to fulfill these expecta- 
tions,’ he says. “I think member states will 
get what they pay for.” 

In 2017, Chan, who has been criticized for 
being too conflict-averse, is stepping down; 
a new, more forceful personality at the helm 
could make a huge difference, says Ilona 
Kickbusch, an independent global health 
consultant based in Brienz, Switzerland, 
and one of the authors of the new report. A 
group looking at proposals to reform WHO’s 
governance has discussed limiting directors- 
general to one 7-year term instead of two 
5-year terms, she says. Free of the need to be 
reelected, WHO leaders might find it easier 
to challenge member states. 

Liu is hopeful that a stronger WHO will 
emerge from the soul-searching. “The reality 
is today, there is only one organization that 
has the legitimacy and the authority to call 
a Public Health Emergency of International 
Concern, and that is not MSF, or the World 
Bank. It is WHO,” she says. “So they need to 
have the capacity to do that.’ & 
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SCIENCE POLICY 


Russia targets Western ties 


Crackdown on “foreign agents” and “undesirable” groups 
threatens private support for science 


By Vladimir Pokrovsky 


he news last week that the Dynasty 

Foundation, Russia’s only private 

funder of scientific research, is clos- 

ing its doors adds to a darkening 

prospect for philanthropic support of 

Russian science. The decision by the 
Dynasty Foundation’s council, announced 
8 July in a terse one-sentence notice on 
Dynasty’s Russian-language website, came 
weeks after the Russian government had 
labeled Dynasty a “foreign agent” under a 
recently enacted law. 


Russian parliamentarians Valentina Matviyenko 
and Konstantin Kosachev pushed for the new laws. 


That move was part of what many see as a 
growing official crackdown on organizations 
the government considers subversive. A new 
mechanism, separate from the foreign-agent 
law, threatens to label such groups as “un- 
desirable’ and make collaborating with 
them illegal, potentially curtailing their sup- 
port for scientists. “These two stories are 
very symptomatic and replicate each other,’ 
says Mikhail Gelfand of the Russian Acad- 
emy of Sciences’ (RAS’) Institute for Infor- 
mation Transmission Problems in Moscow, 
one of Russia’s top biologists. 

In May, the Russian Ministry of Justice 
added Dynasty to its list of foreign agents—a 
new designation for organizations that re- 
ceive funding from the West (Science, 5 June, 
p. 1067)—on the grounds that Dynasty’s 
founder, telecommunications mogul Dmitry 
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Zimin, funds the foundation from personal 
accounts held in Western banks. Zimin was 
infuriated and left Russia in early June. 
Dynasty’s governing council decided to try to 
keep operating without personal donations 
from Zimin, but it declared the foreign agent 
label “absolutely unacceptable.” The final 
straw came in mid-June, when a court fined 
Dynasty 300,000 rubles (about $5500) for 
refusing to register as a foreign agent. 

Anna Piotrovskaya, the foundation’s ex- 
ecutive director, told the press that she can- 
not say exactly when it will close but that all 
of Dynasty’s obligations to current grantees 


will be fulfilled. Vladimir Putin’s spokesman 
Dmitry Peskov said the Kremlin regrets that 
the foundation will be liquidated. But he 
said no one forced the closure. 

Some Russian scientists, however, called 
the move outrageous. Zimin’s public humili- 
ation was an “especially dishonorable ac- 
tion,” says Evgeny Onishchenko, a researcher 
at RAS’ Lebedev Physical Institute in Mos- 
cow. “He was unselfishly helping to develop 
and popularize science in Russia, but the au- 
thorities have ... publicly made a spy of him.” 

As Dynasty’s board deliberated, Russian 
authorities were enacting even more draco- 
nian measures. A new law, which was passed 
in May and took effect in June, authorizes 
the Russian prosecutor general to label “un- 
desirable” any group deemed to threaten the 
foundations of constitutional order, national 
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defense, or national security. Organizations 
so designated must cease activities in Russia 
and will have their bank accounts blocked. 
Groups or individuals who cooperate with 
them, even outside Russia, face fines and 
could be sentenced to as many as 6 years in 
prison for a second offense. 

On 8 July, the Federation Council—the 
upper chamber of Russia’s parliament— 
published a suggested list of 12 undesirable 
organizations for the prosecutor general to 
investigate. The list includes the MacArthur 
Foundation, which has run grant programs 
in Russia, and the Open Society Founda- 
tions of Hungarian-born philanthropist 
George Soros. Parliamentarians say many 
more names will follow. Press reports have 
suggested that they could include the Ford 
Foundation, the International Research & 
Exchanges Board (which awards grants for 
postgraduate study in the United States), 
and other donors. 

Seeing Soros’s name on the list was be- 
wildering, researchers say, because one of 
his charities—the International Science 
Foundation (ISF)—helped ensure the sur- 
vival of Russian science in the mid-1990s. 
During the economic turmoil that followed 
the collapse of the Soviet Union, ISF spent 
more than $100 million on assistance that 
included 35,000 emergency research grants 
in Russia and funding to build 32 university 
computer centers in Russia’s major cities. 
Soros’s foundations haven’t been active in 
Russia since 2003 and have shown no sign 
of plans to return. 

“Soros has done very much” for Russia, 
says Pavel Arseniev, who directed ISF’s Rus- 
sian office in the mid-1990s. Arseniev notes 
that the government of former Prime Min- 
ister Viktor Chernomyrdin had partnered 
with ISF in funding travel grants. Gelfand 
says the Russian scientific community is 
partly to blame. “They did not react to ac- 
cusations that Soros allegedly was buying 
[Russian] secrets.” 

As for Dynasty, Arseniev faults its law- 
yers for giving the government an opening. 
“Tf you work in an aggressive environment, 
you cannot ignore such details,” he says. 
“Zimin, though he never positioned himself 
as a political figure, has always behaved very 
independently, which undoubtedly annoyed 
the conservative part of the establishment. 
Once they got a possibility, they took him 
off the stage.” 

Onishchenko says the attack on Dynasty 
and the black list are troublingly reminis- 
cent of the Soviet era. “As in the former 
gloomy time, they create the image of an en- 
emy who is responsible for the majority of 
Russia’s problems,” he says. “And those who 
helped Russian science get persona non 
grata status.” & 
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BEHIND THE NUMBERS 


Salaries pump up biomedical inflation 


By Jeffrey Mervis 


According to the conventional wisdom, the 
cost of the things needed to do biomedical 
research in the United States rises faster 
than the price tags on all consumer goods 
and services. For 3 decades, an index cre- 
ated by the National Institutes of Health 
(NIH) has documented that disparity, giv- 
ing lobbyists more ammunition to plead 
with lawmakers that NIH’s annual budget 
hikes should exceed the overall rate of in- 
flation in the country. 

The NIH index, which captures the cost 
of such things as reagents, test animals, 
and scientific equipment, has at times 
outpaced that broader index by as many 
as three percentage points. But in 2012, 
a strange thing happened that challenges 
the conventional wisdom: 


example of NIH’s chronic boom-and-bust 
funding cycles. A 2-year, $10 billion budget 
spike, part of a massive stimulus package 
to help the U.S. economy recover from the 
2008 global financial meltdown, was end- 
ing, and money was tight. 

The decision to cap salaries sent the 
BRDPI into a tailspin, bringing it below the 
already low GDP PI. In 2008 the biomedical 
index had hit a 20-year peak of 4.7%, more 
than twice the 2.1% rate for the GDP PI. By 
2010 it had fallen somewhat, to 3.0%, but it 
still far outpaced the minuscule 0.9% rate 
clocked by the GDP PI. Then it plummeted 
in 2012, whereas the GDP PI rose to 1.9%. 

Outsiders might have thought the drop 
would be good news. After all, if biomedi- 

cal inflation slows, then NIH 


The Biomedical Research 
and Development Price 
Index (BRDPI) fell below 
the Gross Domestic Prod- 
uct Price Index (GDP PI), a 
variation of the more famil- 
iar Consumer Price Index. 
The biomedical index’s 1.3% 
growth rate that year not 
only trailed the GDP PI’s 
1.9% but also was the low- 
est in the BRDPI’s history. 


percent 


Share of the biomedical 
research price index taken 
up by salaries 
and fringe benefits. 


should be able to stretch its 
limited dollars further. 

But top NIH officials did 
not trumpet the slowdown. 
The former head of NIH’s 
extramural program, Sally 
Rockey, was in the habit of 
blogging each year about 
the value of the index. She 
has called it an “important 
[way] ... to measure changes 
to the purchasing power of 


(The news, which NIH re- 

ported in January 2014 after the final 2012 
numbers had been crunched, went largely 
unnoticed at the time.) 

To find out why that year was such an 
anomaly, one needs to know what goes into 
the BRDPI. NIH told Science that the infor- 
mation wasn’t publicly available, but that it 
could be obtained under the federal Free- 
dom of Information Act (FOIA). We did so, 
and learned that the index is dominated not 
by the cost of equipment and supplies, but 
rather by the salaries and benefits paid from 
a grant. In fact, overall personnel costs typi- 
cally account for two-thirds of the change in 
the index from one year to the next. 

The outsized effect of salaries and ben- 
efits on biomedical inflation became clear 
after Congress passed a spending bill in 
December 2011 that lowered the salary 
ceiling for investigators on a standard NIH 
grant from $199,700 to $179,700. Legisla- 
tors wanted to free up money for more 
research grants at a time when scientists 
complained that NIH’s $30 billion bud- 
get couldn’t support enough of their good 
ideas. The 2011 bill was the most recent 
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the NIH dollar and make 
projections for future fiscal years.” But in 
a 28 March 2014 posting, she simply noted 
that the 2012 dip “was primarily driven by 
the reduction in the salary cap for princi- 
pal investigators.” 

Nor does a departmental memo an- 
nouncing the record-low 2012 BRDPI 
number include any formal reaction to the 
surprising slowdown. The same memo does 
include a preliminary estimate for the 2013 
BRDPI that shows it exceeding the broader 
index, and the authors appear relieved at 
the prospect of returning to the status quo. 
The 2013 BRDPI figure, they note, although 
“still low by historical standards ... is at 
least once again growing at a higher rate 
than the [GDP] price index.” 

NIH enjoys strong support in Congress, 
and the realization that biomedical infla- 
tion largely tracks salary trends, not the 
sticker price of essential lab equipment and 
supplies, is unlikely to have a major impact 
on policy debates. Still, it may behoove 
biomedical lobbyists to think twice before 
citing the cost of high-tech science as a ra- 
tionale for pumping up NIH’s budget. & 
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MEANS TO AN END 


Cities, states, and provinces are gearing up to halt their AIDS 
epidemics—though the definition of success varies 


fter what Shane Ryan calls “a silly, 
stupid weekend” in late April, he 
joined the many other men in San 
Francisco, California, who line up 
six mornings each week and wait 
for the doors to open at Magnet. 
Located in the city’s Castro dis- 
trict, Magnet provides sexual 
health services for gay and bisex- 
ual men. Last year, the center diagnosed 37% 
of the new HIV infections in the city. Ryan, 
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By Jon Cohen 


like most of the other men in line, came to 
find out whether his condomless sex had the 
consequence he feared. “Usually, ’m very 
safe, always precautious, but I went a little 
sidetracked that weekend,” Ryan said. 

Ryan, who is 24 years old and grew up in 
Ireland, met with a nurse practitioner, who 
explained that he could immediately start 
taking antiretrovirals (ARVs). This so-called 
postexposure prophylaxis (PEP) might abort 
an infection if indeed he had been exposed to 
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HIV, the nurse explained. Ryan began taking 
a pill each day that contains four ARVs. 

In June, when Ryan’s follow-up test came 
back negative, another Magnet nurse prac- 
titioner, Pierre-Cédric Crouch, asked if he 
wanted to start on pre-exposure prophylaxis, 
or PrEP. “Having gone on PEP is an indication 
that PrEP might be a good thing for you to 
do,” Crouch said. He explained that the daily 
PrEP pill contains only two of the four drugs 
used in PEP and has far fewer side effects. It 
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After this test showed Shane Ryan’s blood was 
negative for the AIDS virus, Magnet’s nurse 
practitioner Pierre-Cédric Crouch (background) 
offered him anti-HIV drugs to prevent infection. 


is not a substitute for using condoms, he said, 
but it provides “that extra layer of protec- 
tion.” Ryan opted to try it and was directed 
to a PrEP benefits manager who would help 
him get the drug at low or no cost. 

Magnet’s aggressive attempt to prevent 
the spread of HIV to people at high risk is 
part of a groundbreaking citywide initiative 
launched this January that aims to make San 
Francisco the first jurisdiction in the world to 
get transmission down to such low levels that 
it effectively will “end” its epidemic. Specifi- 
cally, San Francisco wants to achieve a 90% 
drop in new infections and deaths by 2020. 
“T think we’re going to get to the point where 
HIV diagnoses are very rare,’ says Steve 
Gibson, who heads Magnet. 

New York state and the Canadian prov- 
ince of British Columbia (BC) are also at the 
vanguard of the ending AIDS movement. 
These three locales have different defini- 
tions of what it means to end AIDS, as well 
as different approaches, tailor-made to their 
distinct demographics and politics. All, 
however, rely on aggressive use of ARVs and 
intensive efforts to identify newly infected 
people and their partners. Those strategies 
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reflect several game-changing findings of 
the past 5 years. 

Researchers now know that if infected 
people take their ARVS and knock down 
the virus to undetectable levels in the blood, 
they rarely transmit it to their sexual part- 
ners. The approach, called treatment as 
prevention, pays special dividends with 
recently infected people, who account for a 
disproportionate amount of spread. Studies 
have also shown that uninfected people who 
take the drugs pre-exposure—PrEP—can 
greatly reduce their own risk of infection. 

Buoyed by these remarkable advances 
against a virus that to date has infected 
76 million people and shortened the lives of 
half of them, the Joint United Nations Pro- 
gramme on HIV/AIDS (UNAIDS) in 2014 set 
an “ambitious treatment target” of 90-90-90 
by 2020: 90% of infected people worldwide 
will know their HIV status, 90% of those will 
receive ARVs without interruption, and 90% 
of those will have no detectable virus in their 
blood. Doing the math, that means 73% of 
the total infected population will have fully 
suppressed viral levels. This treatment-as- 
prevention strategy, even without PrEP, will 
“end the AIDS epidemic as a major global 
health threat by 2030,” UNAIDS predicts. 

A just-released report, “Defeating AIDS— 
advancing global health,’ in the 25 June on- 
line issue of The Lancet, is less optimistic. 
“There is an urgent need to do more and 
to do better now,” declares the report, by a 
commission including heads of state, pub- 
lic health leaders, and even actress Charlize 
Theron. It notes that too many locales have 
sluggish responses and little hope of down- 
grading their epidemics to “low-level ende- 
micity” (see sidebar, p. 230). But BC, New 
York, and San Francisco aim to pave the way 
in the ending AIDS movement, which will be 
a central topic at an international HIV/AIDS 
meeting in Vancouver 19 to 22 June. 


JULIO MONTANER, CO-CHAIR of the Van- 
couver meeting and the key architect of 
90-90-90, took a gamble when he and his 
colleagues published an editorial in the 
5 August 2006 issue of The Lancet that ar- 
gued for making ARVs the backbone of pre- 
vention efforts. “We were told by UNAIDS 
that we were medicalizing prevention, and 
that this was irresponsible,” says Montaner, 
who is at the BC Centre for Excellence in 
HIV/AIDS in Vancouver (an offshoot of 
the University of British Columbia). He ac- 
knowledges that he and his colleagues were 
relying more on mathematical models than 
hard evidence that treatment as prevention 
worked. But studies vindicated him. “I’m 
happy we were right.” 

Although BC has not declared an explicit 
goal for “ending AIDS,’ Montaner says it is 
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committed to reaching the 90-90-90 target 
by 2020. Nearly seven times the size of New 
York state, the province has only 4.6 million 
residents, an estimated 12,000 of whom are 
infected with HIV, giving it a prevalence rate 
of 0.3%. The epidemic boomed in the early 
1990s in men who have sex with men (MSM). 
When heroin use surged around 1996 in 
downtown Vancouver, HIV infections did 
as well, which led authorities to declare a 
public health emergency the next year. The 
government expanded a long-running meth- 
adone maintenance program, distributed 
free needles, and in 2003 opened the first su- 
pervised injection site in North America. In 
2011, the province began offering ARVs to all 


Treatment cascade 

For the treatment-as-preventwion strategy 
to work, people must know their HIV 
status, receive appropriate medical care, 
and take ARVs regularly such that their 
HIV becomes undetectable. 
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with HIV to care Undetectable HIV 


infected people, whereas most of the world, 
concerned about cost and side effects, still 
recommended withholding treatment until a 
person’s immune system was damaged. 

With the widespread use of ARVs, 
AIDS-related illness and death in BC has 
plummeted since 1996. New HIV diagnoses 
have fallen from a high of 681 per year in 
1995 to 262 in 2014, with a huge drop among 
people who inject drugs. But among MSM 
and heterosexuals, the number of new di- 
agnoses has barely budged for several years. 
And although Vancouver has been relatively 
successful, the epidemic is poorly controlled 
in more remote regions of the province. 

In BC as a whole, some 80% of infected 
people know their HIV status, 61% take 
ARVs at some point, but only 51% have un- 
detectable viral levels—the cornerstone of 
the treatment-as-prevention strategy. “Treat- 
ment as prevention has not been pushed to 
the limit,’ Montaner says. So the BC center 
and its collaborators have recently stepped 
up efforts to find infected people and make 
sure everyone who starts ARVs stays on 
them so the virus is fully suppressed. 
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Following a surge in HIV infections from sharing needles, Vancouver opened Insite in 2003 for drug users to inject safely. 


As part of its revved-up effort, BC is one of 
a growing number of places that is exploiting 
DNA sequencing of the fast-evolving virus to 
work out how isolates are related. That al- 
lows epidemiologists to pinpoint hot spots of 
transmission, where interventions can then 
be targeted (Science, 12 June, p. 1188). These 
data have led health authorities to concen- 
trate outreach efforts in the northern, rural 
part of the province, which is home to many 
indigenous Canadians. 

Some criticize BC’s program for not going 
beyond 90-90-90 and offering PrEP to high- 
risk but uninfected people like Shane Ryan, a 
mainstay of efforts in New York and San Fran- 
cisco. “I don’t think we’re going to eliminate 
the epidemic without PrEP,” 
says Susan Buchbinder, who 
directs HIV prevention at the 
Department of Public Health 
in San Francisco. Montaner, 
who notes that Canada’s gov- 
ernment-funded health care 
plan does not reimburse for 
PrEP, is not convinced PrEP 
is needed. But he says he is 
keeping an open mind. If in 
5 years the jurisdictions that 
combine both strategies are 
ahead of us, he says, “I'll say 
PrEP is the path to take.” 


MARK HARRINGTON and 
Charles King hatched their 
bold plan for ending the 
AIDS epidemic in New York 
in the back of a paddy 
wagon. In 2012, during an in- 
ternational AIDS conference 
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in Washington, D.C., the prominent activ- 
ists were arrested outside the White House 
while protesting the Obama administra- 
tion’s HIV/AIDS policies. Based in New York, 
Harrington runs Treatment Action Group, 
an HIV/AIDS think tank, and King is head of 
Housing Works, which helps find homes for 
people living with the virus. Convinced that 
the administration’s plan was not aggressive 
enough, they began brainstorming while en 
route to their jail cell about how they could 
do better. 

With support from HIV/AIDS research- 
ers at Columbia University and the AIDS 
Institute at the New York State Department 
of Health, Harrington and King organized 


On Gay Pride Day in June 2014, New York Governor Andrew Cuomo (D), shown marching 
in a Manhattan parade, announced a plan to end AIDS in the state. 
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community meetings to explore how New 
York could drive down new infections to 
such a low level that the epidemic would 
peter out. “It brought more gravitas to the 
whole thing that it was researchers, advo- 
cates, and policymakers all working to- 
gether to push for this,” says psychologist 
Robert Remien, who heads Columbia’s HIV 
Center for Clinical and Behavioral Studies. 

The idea received an enormous political 
boost in June 2014 when New York Governor 
Andrew Cuomo, a Democrat, announced a 
three-point plan to end the state’s AIDS epi- 
demic by expanding testing, treatment, and 
access to PrEP. A few months later, Cuomo set 
up an Ending the Epidemic (ETE) Task Force, 
which includes Harrington, 
King, other community ad- 
vocates, scientists, and gov- 
ernment health workers. In 
April 2015, at a ceremony to 
unveil the group’s blueprint, 
Cuomo said that when he 
announced the goal, people 
thought it was “outrageous.” 
But New York will suc- 
ceed and set an example, 
said Cuomo, who budgeted 
$10 million this year to sup- 
port the ETE project. 

The meaning of “ending 
AIDS” varies from place to 
place, and the “Defeating 
AIDS” report urges the inter- 
national AIDS community to 
agree on “a precise scientific 
and epidemiological defini- 
tion of low-level endemicity.” 
The report defines the end 
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as a reproduction number below one, mean- 
ing that each infected person transmits the 
virus to less than one other person on aver- 
age. At that rate, the epidemic will gradually 
fade. No locale has adopted that exact lan- 
guage. Governor Cuomo offered this defini- 
tion: “The end of the AIDS epidemic in New 
York state will occur when the total number 
of new HIV infections has fallen below the 
number of HIV-related deaths.” In 2012, the 
state had an estimated 3000 new infections 
and 1653 AIDS deaths (with an ongoing de- 
bate about how many of those were HIV re- 
lated). The ETE blueprint aims to reduce the 
number of new infections to 


750 in 2020. 
New York has more HIV- 
infected residents, about 


half of whom are MSM, than 
any state. Out of a popula- 
tion of nearly 20 million, an 


estimated 154,000 people Se e 


were living with HIV in 
2012. Of those, 132,000, or 
89%, knew their status, but 
only 68,000, or 44%, were 
on treatment and had un- 
detectable levels of virus, far 
below the UNAIDS target. 
To decrease the gap between 
diagnosis and effective treat- 
ment, which Harrington 
refers to as “the Grand Can- 
yon,’ the blueprint calls for 
stepping up surveillance to 
find people who know their 
status but don’t consistently 
use ARVs and to assist them 
with nonmedical challenges 
that might get in the way 
of treatment, such as find- 
ing housing and jobs. It 
will also provide incentives 
to both care providers and 
patients to help people stay 
on treatment. 

Meanwhile, 22,000 New 
Yorkers don’t even know 
they are infected. To reach 
them, the state will ramp up 
routine testing in places like 
emergency rooms and send 
out more mobile testing units to high-risk 
populations, including MSM, transgender 
people, new immigrants, the homeless, and 
the mentally ill. ETE is launching a statewide 
education campaign about PrEP—which has 
been approved by the U.S. Food and Drug 
Administration for 3 years but has been slow 
to catch on, in part because some uninfected 
people worried that it would brand them as 
promiscuous and reckless. New York is also 
creating programs to help people access 
PrEP drugs. Medical record “autopsies” of 
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people who die from AIDS will try to identify 
how they slipped through the cracks. 

The state’s portion of the price tag for put- 
ting 68,000 more people on treatment by 
2020 is $2.25 billion, according to a recent 
analysis done by Treatment Action Group 
and Housing Works. Additional housing 
support for the 12,000 or so HIV-infected 
people who are homeless or poor would add 
as much as $720 million to the bill. But im- 
proved care and decreases in new infections 
would reduce the load on the health care 
system by enough to save New York nearly 
$8 billion, the analysis finds. 


New York City’s hot spots of spread 
With genetic sequences of HIV from different people, researchers can 
identify how the virus moves between individuals (circles). Mapping these 
transmission clusters can help target treatment and prevention efforts. 
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ON WORLD AIDS DAY in 2013, Diane 
Havlir of the University of California, San 
Francisco (UCSF), moderated a forum for 
the local community entitled “Getting 
to Zero in San Francisco: How Close Are 
We?” After experts described their city- 
wide efforts, Havlir, a clinician who heads 
the HIV/AIDS division at San Francisco 
General Hospital, invited questions from 
the audience. 

“You guys are all doing great things,’ a 
man said. “Are you working together?” 
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Havlir was taken aback. “At that moment 
we recognized that we were really not work- 
ing together,” she said recently. That discus- 
sion sparked the Getting to Zero Coalition, 
which bands together UCSF, the city’s health 
department, HIV/AIDS advocacy groups, 
and major hospitals. In January, the coali- 
tion released a strategic plan for ending the 
city’s epidemic. 

Although San Francisco has just 20% the 
population of BC, it is home to more HIV- 
infected people—an estimated 15,901 at the 
end of 2013, an adult prevalence of about 
2%. One of the first cities to be hard-hit by 
HIV, San Francisco early 
on developed a strong re- 
sponse by combining the 
research and clinical re- 
sources of UCSF and the lo- 
cal health department. The 
city’s epidemic surveillance 
is sophisticated and granu- 
lar, including mapping of 
viral load levels by neigh- 
borhoods, which reveals 
where people are not con- 
trolling their infections. In 
2010, San Francisco became 
the first jurisdiction to rec- 
ommend immediate treat- 
ment for all HIV-infected 
people. Three years later, 
San Francisco General Hos- 
pital went a step further 
and launched a RAPID pro- 
gram that starts people on 
treatment the very day they 
are diagnosed, if possible. It 
also sends outreach teams 
to the streets to search for 
patients who miss appoint- 
ments (Science, 13 July 2012, 
p. 175). Researchers at UCSF 
and the health department 
pioneered PrEP studies. 

Exceptional as San Fran- 
cisco’s response has been, 
it also has serious short- 
comings, which the stra- 
tegic plan confronts with 
three “signature initiatives”: 
scale up RAPID citywide, 
teach people about PrEP and make it easy 
to get—a la Shane Ryan’s visit at Magnet— 
and work more diligently to keep people tak- 
ing their ARVs. Testing is a not a big issue, 
as some 94% of infected people in the city 
know their status. But whereas 89% of those 
people see a specialist within 90 days, only 
63% start on ARVs and achieve undetectable 
viral loads for prolonged periods. The Get- 
ting to Zero Coalition hopes to end AIDS by 
bringing that 63% up to 90% by 2020—an 
even more ambitious goal than the 73% set 
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No end in sight 


By Jon Cohen, in Tijuana, Mexico 


n 6 December 2013, Pedro Robles 

spent 14 hours in an ambulance 

being driven up Mexico’s Baja 

California Peninsula. The 51-year- 

old man was not rushed north for 
emergency medical care. Time was not of 
the essence. 

Robles had an advanced case of AIDS, 
and he was being driven 1127 kilometers 
north from his home in Loreto to 
Albergue Las Memorias A.C. in Tijuana. 
A nongovernmental organization (NGO) 
arranged the trip, because Las Memorias 
is the only AIDS hospice on the entire 
Baja California Peninsula, Robles was 
broke, and Tijuana held out the remote 
hope that someone there could navigate 
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the medical bureaucracy and maybe save 
his life. But Las Memorias itself, which 
also serves as a drug rehabilitation center 
and is largely run by its residents, has 

no trained medical staff. And although 
Las Memorias did what it could to make 
Robles comfortable, Tijuana ultimately 
failed him: He died 6 days later without 
ever having seen a doctor. 

As the dream of ending AIDS catches 
hold in a growing number of locales (see 
main story, p. 226), Tijuana is hardly 
anomalous: Many places are still strug- 
gling to provide basic treatment and 
prevention services. Of course, people still 
die from AIDS in wealthy countries like 
the United States, which is visible from 
downtown Tijuana, but appropriate care 
is so readily available that AIDS hospices 
shut their doors years ago. 

Like Tijuana, too many locales appear to 
be “running at a standstill” and are saddled 
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by “poor strategy, absence of leadership, 

or inadequate resources,” laments a promi- 
nent commission in a report in The Lancet 
last month, “Defeating AIDS-advancing 
global health.” 

The drop in HIV infections and AIDS- 
related deaths worldwide over the past 
dozen years has been impressive, the 
report says. But without a “massive and 
rapid expansion of a comprehensive AIDS 
response,” the global toll—still more than 
a million new infections and deaths each 
year—will worsen again over the next 
5 years, and the world will fail to reach 
the United Nations goal of “ending AIDS 
as a public health threat” by 2030. 

Mexico is not particularly hard-hit by 
HIV. In 2014, UNAIDS estimated the coun- 
try had 190,000 infected people, which is 
an adult prevalence of 0.2%—lower than 
in the United States. The government 
offers free antiretrovirals (ARVs) and, since 
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November 2014, has recommended that 
all HIV-infected people receive them as 
soon as diagnosed. “When you look back 
to what we were doing 10 years ago, we 
are really, really better,’ says Carlos Magis- 
Rodriguez of the National Center for HIV/ 
AIDS Prevention and Control in Mexico 
City. But he acknowledges that Tijuana 
and other cities in Mexico are struggling. 
Tijuana has what is known as a 
“micro-hyperepidemic.” Overall HIV 
prevalence in Tijuana is 0.6%—the same 
as the United States. But the rate is 
soaring in high-risk groups. In women 
who sell sex, prevalence jumped from 
2% in 2003 to 6% by 2012, according 
to recent studies by researchers from 
the University of California, San Diego 
(UCSD), whose team includes Mexico- 
based colleagues. Clients of these sex 
workers had a prevalence of 5%, they 
found. HIV prevalence is also about 
5% in the many people in Tijuana who 
inject drugs. Preliminary studies of men 
who have sex with men and transgender 
people suggest about 20% are infected. 
In theory, Tijuana should be able to 
rein in its concentrated epidemic by 
taking advantage of recent advances. 
Key among them is the 2011 demonstra- 
tion that people who fully suppress their 
HIV levels with ARVs rarely spread the 
virus to their sexual partners. But this 
treatment-as-prevention strategy has not 
gained much traction in Tijuana. UCSD 
behavioral health scientist Laramie 
Smith recently pooled data from six stud- 
ies of nearly 200 HIV-infected people in 
Tijuana and found that only about half 
even knew they had the virus. Tijuana 
offers free HIV testing through NGOs 
and government-funded clinics, yet no 
plan is in place to regularly test high-risk 
people at venues where they hang out, 
like gay bars or the red light district. 
Those who do learn they're infected 
rarely get treatment. In Smith’s study, 
only 11% received related medical care, 
and only 3.66% began taking ARVs. The 
federally sponsored HIV/AIDS clinic, 
CAPASITS, provides free ARVs, but it 
is located far from the downtown area 
and is difficult for many people to reach. 
Tijuana is also a hub for migrants, 
including many deportees from the 
United States, and some do not have 
the documents required to receive help 
at CAPASITS. And the services there 
fall short of those in developed coun- 
tries: CAPASITS, for example, must ship 
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patient blood samples to Mexico City for 
measurements of CD4 lymphocyte counts 
and HIV levels. 

José Luis Burgos, a Tijuana-based 
clinician who works with the UCSD team, 
says a key problem is that the patient 
load in Tijuana outstrips the availability 
of qualified HIV/AIDS doctors. Burgos 
contends that Tijuana could train primary 
care physicians to diagnose and treat HIV/ 
AIDS patients. “You need to demystify 
HIV care,” he says. CAPASITS, he notes, 
has only three doctors and treats some 
1000 patients. “What kind of care can you 
expect from three providers?” 

Paradoxically, Mexico’s rising economic 
status is hampering the fight. A 2011 grant 
from the Global Fund to Fight AIDS, 
Tuberculosis and Malaria enabled two 
Tijuana NGOs to launch mobile needle- 
exchange units. But the grant ended in 
2013 when Mexico achieved upper middle 
income status and became ineligible for 
Global Fund support. Tijuana’s needle- 
exchange programs shriveled overnight. 
“Mexico is supposed to be upper middle 
income, but the border isn’t,’ Burgos says. 

The UCSD team soon documented a 


“Mexico is supposed to 
be upper middle income, 
but the border isn’t.” 


José Luis Burgos, 
University of California, San Diego 


40% increase in needle-sharing among 
a group of users that it has closely fol- 
lowed. That undermines other efforts to 
reach people with HIV, says UCSD epide- 
miologist Steffanie Strathdee, who leads 
the binational research program with her 
psychologist husband Thomas Patterson. 
“The sad thing” is when drug users come 
in for needle exchange, she says, “they 
have an opportunity to get HIV testing or 
a referral to a drug treatment program.” 
Strathdee hopes the group’s extensive 
research will draw attention to the prob- 
lems and the opportunities in Tijuana. 
“Tt’s entirely possible to end the AIDS 
epidemic in Tijuana,” she says. And if 
Tijuana can do it, so can much of the rest 
of the world. = 


Reporting for this story was supported in 
part by a grant from the Pulitzer Center 
on Crisis Reporting. 
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by 90-90-90. The city’s ending AIDS targets 
also include reducing new HIV diagnoses 
from 371 in 2013 to 37 in 2020 and HIV-re- 
lated deaths from an estimated 91 to eight. 

“The biggest challenges are reaching 
into marginalized populations that are not 
getting the services they need, says the 
health department’s Buchbinder—transgen- 
der people, drug users, African-Americans, 
and the young. Getting to Zero is massively 
expanding the use of “navigators” assigned 
to infected people to help address barriers 
that interfere with care, such as substance 
abuse, food insecurity, homelessness, and 
violence. A new program will better coordi- 
nate medical records at different providers 
to help identify patients who are slipping 
through the net. And HIV-infected patients 
deemed at high risk of not taking ARVs will 
receive extra check-in phone calls and re- 
minders for appointments. 

Rigorous evaluation is critical for the cam- 
paign to succeed, said Havlir at a coalition 
meeting in June. Getting to Zero is raising 
money specifically to track the program’s 
performance, using novel metrics like as- 
sessing the impact of PrEP on new infection 
rates. “This is not just talking, talking, talk- 
ing,” Havlir said. “This is about action.” 


THE DRIVE TO END AIDS is spreading 
worldwide, and there is even something of 
a good-natured race to be first. Washington, 
D.C., New South Wales in Australia, and Bra- 
zil are now in the running as well, and San 
Francisco has attracted intrigued delegations 
from Amsterdam, France, and the White 
House’s Office of National AIDS Policy. 

From his office at the London School of 
Hygiene & Tropical Medicine, which he di- 
rects, epidemiologist Peter Piot is watching 
these efforts with interest—and some skep- 
ticism. “It’s very important that these proj- 
ects proceed and that we learn from them,” 
says Piot, who chaired the “Defeating AIDS” 
commission and formerly headed UNAIDS. 
But he cautions that the intensive efforts 
in BC, New York state, and San Francisco 
must continue indefinitely. “These three 
examples are not North Korean types of 
islands—there will be constant reintroduc- 
tion of the virus,” he notes. 

Ending the global spread of HIV will ul- 
timately take a vaccine, Piot says, stressing 
that treatment as prevention packs a lim- 
ited punch. He points to a mathematical 
model in “Defeating AIDS” that found that 
even if the world achieves the UNAIDS 90- 
90-90 goal in 2030, hundreds of thousands 
of new HIV infections and deaths will still 
occur each year. “We’ll have to see whether 
these three places can end AIDS as a public 
health threat,’ Piot says. “But that doesn’t 
mean one shouldn’t try.” & 
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Electron microscopy provides atomic-reso 
structures of nanoparticles in solution 


By Christian Colliex 


A triumph of electron microscopy. Park et al. use a specially equipped electron microscope to determine the 
atomic structure of individual platinum nanoparticles in solution. 
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anoparticles containing a few tens 
or hundreds of atoms have applica- 
tions in an ever-expanding range of 
areas, from electronics to catalysis 
and biological sensors. This versatil- 
ity stems from the high sensitivity 
of nanoparticle properties to size, chemical 
composition, and shape, which are largely 
determined by the synthesis route by which 
they are produced. Nanoparticles grown in 
solution, termed “colloidal nanoparticles,” 
are of particular interest because they may 
be used in many applications, including as 
biosensors (7). On page 290 of this issue, 
Park et al. (2) report high-resolution struc- 
tures of nanoparticles in solution deduced 
from electron microscopy. This spectacular 
result relies on the convergence of recently 
developed techniques from both the physi- 
cal and life sciences and heralds a new era of 
high-resolution studies in solution. 

Colloidal particles made of noble metals, 
particularly gold or platinum, have been 
widely studied for applications in catalysis 
and photocatalysis. As early as 20 years ago, 
Ahmadi et al. showed that the shape of col- 
loidal Pt nanoparticles could be controlled by 
changing the composition and concentration 
of capping materials and metal cations in the 
solution (3). To image the particles, they de- 
posited nanoparticles on an amorphous car- 
bon layer. By tilting the resulting specimen, 
they found that nanoparticles with square 
shapes were cubic and nanoparticles with 
triangular shapes were tetrahedral. However, 
they could not resolve the three-dimensional 
(3D) atomic structure or study the particles 
directly in solution. This is what Park ez al. 
now achieve for individual nanocrystals. The 
work follows similar progress realized a few 
years earlier in a gaseous environment (4). 

The success of the study by Park et al. re- 
lies on three technological developments. 
The graphene liquid cell (GLC), introduced 
previously by the same group (5), maintains 
nanometer-scale objects in a liquid environ- 
ment between two layers of free-standing 
graphene. Highly sensitive direct electron 
detectors enable the production of movies 
with about 50 frames per second. To analyze 
their data, the authors adapted the theory 
and associated software tools created by mo- 
lecular biologists to reconstruct the struc- 
tures of single biomolecules. 

In the approach used by Park et al., mov- 
ies follow individual Pt nanocrystals freely 
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rotating in solution (see the figure, panel A). 
These nanoparticles may thus be observed 
in many different orientations, providing 
the required number of different projec- 
tions for the reconstruction algorithm. In 
molecular biology (see the figure, panel B), 
multiple copies of biomolecules with identi- 
cal structures are flash-frozen in a vitrified 
thin layer of ice (6). One micrograph covers 
a field with hundreds of particles viewed at 
different orientations. These images of in- 
dividual molecules yield a collection of 2D 
projections that are then used to generate a 
single 3D structure. 


A Nanoparticle 


Electrons 
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Freely rotating nanoparticle in GLC 


Reconstructed 
3D object 


Projected 2D image 


Three-dimensional structures from electron microscopy. (A) Park et a/. take many images of an individual platinum nanoparticle as 
it rotates freely in the solution contained in a GLC. They then combine these images of the same particle to determine its 3D structure. 
The method was adapted from cryo-EM (B), where 2D proejctions of numerous copies of the same biomolecule frozen in a vitrified ice 


tron energy-loss spectroscopy (EELS) to map 
the spatial distribution of Fe, O, and N ele- 
ments and also reported an analysis of the 
iron valence state in ferritin and in solution. 
The results show that GLCs provide a poten- 
tially useful route for monitoring biochemi- 
cal reactions in situ. 

As for the latest generation of direct 
electron-detection camera technology, it 
has contributed to the rapid improvement 
of high-resolution structures reconstructed 
from cryo-electron microscopy (cryo-EM) 
data. Their signal-to-noise ratio is improved 
in the movie frame mode, allowing refined 
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nanocrystals can generally be considered as 
rigid, as long as the electron beam does not 
cause any transformations. 

The reconstruction of 3D atomic struc- 
tures of individual metallic nanoparticles in 
a vacuum or embedded in a solid matrix has 
benefited from advances in electron micros- 
copy techniques, such as aberration correc- 
tion for higher-resolution images, annular 
dark-field scanning transmission mode for 
counting the number of atoms in each col- 
umn, and the development of dedicated algo- 
rithms for tomography. Successful examples 
include the 3D atomic structures of a gold 
nanoparticle in vacuum (J/, 12), 
a silver nanocrystal epitaxially 
embedded in an aluminum ma- 
trix (13), and a gold nanoparti- 
cle of 68 atoms surrounded by a 
thiol molecular cage that makes 
it water soluble (14). Park e¢ al. 
now investigate two distinct 
individual nanoparticles in 
the liquid phase. Their density 


Similar biomolecules randomly oriented in ice 


Projected 2D image 


layer are used to determine the 3D structure of the biomolecule (15). 


Yuk et al. first used a GLC to image with 
atomic resolution a few critical steps (coales- 
cence, reshaping, and faceting) in the growth 
of Pt nanocrystals (5). Since then, GLCs have 
been used to follow the real-time trajectories 
of Au nanocrystals linked to double-stranded 
DNA fragments. The observations confirmed 
that sample-substrate interactions between 
the liquids and the inert graphene layers are 
minimized (7). The authors then deduced 
the 3D motion from a series of 2D images re- 
corded while the complex nanostructure was 
rotating. This was clearly a key step on the 
way to the present demonstration. Recently, 
Wang et al. used GLCs to encapsulate tiny 
water volumes containing ferritin molecules, 
which consist of a core of iron (hydro)-oxide 
and a protein shell (8). They then used elec- 
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compensation of sample movements and 
reducing the number of particles neces- 
sary for reconstructing the structure of a 
biomolecule (9). Very recently, Bartesaghi 
et al. have demonstrated the structure of a 
ligand-protein complex at 0.22-nm resolu- 
tion (JO), quite close to that demonstrated 
by Park et al. 

The algorithms used to reconstruct 3D 
structures from extensive sets of 2D projec- 
tions have been developed over the past two 
decades, mostly for applications in biology. 
One basic assumption is that the population 
of imaged particles is uniform. However, this 
assumption is generally not fully satisfied be- 
cause biomolecules are by nature not rigid. 
Additional computing techniques are thus 
used to sort the images into different classes 
corresponding to different conformations. 
This issue is not as relevant in Park et al’s ap- 
proach, where the same particle is viewed in 
different orientations. Furthermore, metallic 
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maps reveal that each particle 
consists of three nanocrystals 
joined together nonsymmetri- 
cally. The authors analyze the 
local organization of the inter- 
faces between these nanocrys- 
tals and compare the results 
with molecular dynamics calcu- 
lations of the free energies for 
grain boundary formation. 

As Park et al. show, the 
structure and dynamics of 
individual members of an in- 
homogeneous ensemble of 
nano-objects in a fluid can 
now be determined with spe- 
cifically equipped modern elec- 
tron microscopes. This advance 
provides a tool for exploring the growth, 
self-organization and reactivity of nanopar- 
ticles in liquids containing different types 
of ions, salts, or molecules. & 
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Magnetic bubbles with a twist 


Individual skyrmionic bubbles can be generated and moved at room temperature 


By Kirsten von Bergmann 


resent-day hard _ disk 

drives use magnetic bits 

that are read out by mov- 

able read heads. To cir- 

cumvent the inherent 

fragility of such a me- 
chanical construction, and to 
exploit the third dimension for 
increased storage density, Par- 
kin et al. proposed a racetrack 
memory device (7). The general 
concept is that the information 
is encoded in a localized magne- 
tization configuration that can 
be driven through the material 
with electrical currents and transported to a 
stationary read head. Recently, interface-in- 
duced skyrmions, which are circular particle- 
like magnetic objects (see the figure, panel 
A), have been envisioned as ideal candidates 
for future racetrack memory-type applica- 
tions (see the figure, panel B) (2, 3). On page 
283 of this issue, Jiang et al. (4) have made 
progress toward realizing such a device ar- 
chitecture. They report on the generation 
and movement of individual skyrmionic bub- 
bles at room temperature, accomplished by 
exploiting two different spin-orbit coupling- 
related effects. 
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Magnetic skyrmions are localized non- 
collinear states with a twist, possessing the 
key characteristic that they are topologically 
protected. That is, they effectively constitute 
a knot in the magnetic texture that adds to 
their stability with respect to a trivial fer- 
romagnetic state. The application of spin 
transfer torques by the injection of lateral 
or vertical electrical currents can induce a 
motion of magnetic skyrmions at surpris- 
ingly low current densities (5, 6). For such 
topologically distinct states, the direction 
of motion depends on the direction of the 
twist, manifested in the rotational sense of 
the spin canting. 

Twisted magnetic states are useful for 
applications only if the direction of the 
twist is unique, thus enabling control over 
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the movement via the direc- 
tion of the injected current. 
To achieve this, the twist must 
be intrinsic to the system. A 
magnetic interaction that fa- 
vors one rotational sense over 
the other is the Dzyaloshin- 
skii-Moriya (DM) interaction, 
which arises as a result of spin- 
orbit interaction and can occur 
in systems with broken inver- 
sion symmetry as, for example, 
in chiral crystal structures 
(7, 8) or at interfaces (9-11). 
For the interface-induced sky- 
rmions, the rotational sense 
is determined by the specific 
material, and when realized on a large scale 
by standard epitaxial growth processes, 
it promises an easy implementation into 
present state-of-the-art multilayer-based 
device architectures. 

Until now the only experimental observa- 
tions of interface-induced skyrmions were 
in perfectly ordered, atomically thin lay- 
ers, which served as model systems (10, 11). 
Tuning of the skyrmion size from 1 to 5 nm 
was demonstrated. By using local currents 
from a scanning tunneling microscope tip, 
isolated skyrmions were written and de- 
leted individually (17). Future applications 
will rely on other, more practical materials 
in which the DM interaction governs the 
magnetic texture, inducing a defined twist 
of the magnetic state. 


Skyrmions for future storage devices. (A) Sketch of a magnetic skyrmion, a knot in the magnetization with a defined twist. (B) Coherent movement of skyrmions along a 

track with lateral currents inspired by (3). (C) Generation of individual skyrmions by inhomogeneous spin-orbit torques (small blue arrows) behind a geometrical constriction, 
as demonstrated by Jiang et al. (4). (D) Annihilation of skyrmions by small currents that drive the magnetic object out of the magnetic material. (E) Stationary read head; the 
information is encoded in the magnetic texture, which is moved to the readout by small lateral currents along the track. 
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Jiang et al. report on the observation of 
interface-induced skyrmionic bubbles at 
room temperature in a magnetic layer grown 
epitaxially on a material with large spin-orbit 
coupling. The role of the substrate is twofold: 
First, it provides the interface at which the 
DM interaction is induced; and second, by 
driving a lateral current through the sub- 
strate, the charge is converted to a vertical 
spin current due to the spin Hall effect (72). 
The resulting spin-orbit torques act on the 
magnetic layer and induce a movement of 
the micrometer-sized skyrmions. All of them 
move in the same direction, which means 
they all have the same twist, imposed by the 
DM interaction. 

The generation of individual skyrmions 
is realized by inhomogeneous spin-orbit 
torques occurring behind a geometric con- 
striction (see the figure, panel C). Jiang et al. 
propose an intriguing model of how the bub- 
bles are pinched off from a magnetic stripe 
domain driven through the constriction, and 
draw an analogy to the formation of soap 
bubbles. Thus, for the writing of a skyrmion 
into a track, a perpendicular geometry could 
be used, and with a lateral movement in the 
same direction, it should be possible to delete 
single magnetic objects by moving them out 
of the track (see the figure, panel D), unifying 
all relevant operations in one device. 

With the goal of achieving a skyrmion 
racetrack in sight, there are several issues 
that still need to be resolved. The precision 
of the generation and the degree of control 
over each magnetic skyrmion need improve- 
ment. Jiang et al. demonstrate the generation 
of many skyrmionic bubbles at a time, and in 
their sample the movement is strongly influ- 
enced by local pinning due to the inhomoge- 
neity of the film. Furthermore, the size of the 
skyrmions needs to be reduced to be compet- 
itive with present storage densities. Future 
research will show whether nanometer-sized 
skyrmions are possible in materials relevant 
for applications. However, such a small size 
of a skyrmion will not only limit the meth- 
ods to investigate them but also challenge 
the mechanism and design of an appropriate 
readout device (see the figure, panel E). m 
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Is biodiversity good for 


your health? 


Disease incidence is often lower in more diverse 
communities of plants and animals 


By Felicia Keesing and Richard S. Ostfeld 


n a floodplain of the River Saale near 
Jena, Germany, grassland plants are 
naturally bombarded by spores of 
pathogenic fungi. But whether or not 
those fungi cause infection turns out 
to be largely about the neighborhood: 
Plants on highly diverse experimental plots 
have much lower levels of infection than 
plants grown in monoculture (7) (see the 
photos). The pathogens, it appears, are less 
likely to encounter their optimal host on the 
more diverse plots, which reduces disease 
prevalence and incidence. This protective 
effect of diversity has been found in many 
studies, not just for plants but also for dis- 
eases afflicting humans and wildlife. It has 
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The benefits of diversity. Civitello et a/. have performed a meta-analysis of studies of 61 parasites infecting both 


remained unclear, however, whether this ob- 
servation holds generally (2, 3). In a recent 
paper, Civitello et al. addressed this question 
in a rigorous meta-analysis of diversity-dis- 
ease relationships (4). 

Civitello et al. examined 202 results from 
studies of 61 parasites infecting both plants 
and animals. They found a clear and consis- 
tent reduction in disease prevalence when 
diversity was high. The effect was as strong 
for parasites that infect humans as for those 
that infect only wildlife and was seen both 
for microparasites (pathogens such as bac- 
teria and viruses) and for macroparasites 
(such as trematodes and nematodes). The 
protective effect of diversity for human dis- 
eases is particularly important because it re- 
futes the results of a previous meta-analysis 
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animals and plants. The results show that disease prevalence is often higher in less diverse systems (top, barley 


monoculture) than in more diverse systems (bottom). 
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Effects of plant diversity on fungal pathogens in Jena, Germany. Plots with higher host diversity generally had a hig! 


overall infection (B). Drawn from data provided in (1). Note 


of only six parasites, which found no consis- 
tent effect of diversity (5). Civitello et al. also 
found that the effect of diversity was equally 
strong for both experimental and compara- 
tive studies. This is a key result, because al- 
though it has been known for some time that 
diversity can reduce disease in carefully con- 
trolled and constructed experimental com- 
munities, it has been less clear whether the 
effect translates into natural systems. 

The widespread negative effect of high di- 
versity on disease transmission documented 
by Civitello et al. suggests a consistent un- 
derlying mechanism that applies to diseases 
of humans, wildlife, and plants. One part of 
that mechanism has been hinted at in pre- 
vious work: The best hosts for multihost 
pathogens are often abundant, widespread, 
and resilient species (6-8). A recent study 
by Han et¢ al. (9) provides strong evidence 
for this pattern. The authors set out to de- 
termine what characteristics make some 
species good reservoir hosts of diseases that 
infect humans. Working with a database of 
2277 rodent species and 66 pathogens, they 
used machine-learning algorithms to explore 
what aspects of host life history, physiology, 
behavior, ecology, and biogeography are asso- 
ciated with pathogens that spill over into hu- 
man populations. Several patterns emerged. 
Good reservoir hosts have broad geographic 
ranges that contain comparatively few other 
species. They also reach maturity at a young 
age and have a short gestation period and a 
large litter size, all characteristics of what is 
often called a fast life history. 

Why might species with fast life histories 
be good hosts for pathogens? Two hypoth- 
eses have garnered some support. First, 
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parasites and pathogens that infect multiple 
species might evolve to exploit those hosts 
that they are most likely to encounter in na- 
ture—in other words, the most widespread, 
abundant hosts. Species that are widespread 
and abundant often have fast life histories 
(0). The second possibility is that fast-living 
hosts tend to be less resistant to, or more 
tolerant of, infection because of how they al- 
locate their immune defenses (11). 

Whatever the causes of these patterns, 
the consequences are intriguing. Species 
with fast life histories are often resilient to 
disturbance and are thus likely to persist 
in the face of environmental stressors that 
cause other species to decline or disappear 
(12). Together, these two patterns—the resil- 
ience and ubiquity of species with fast life 
histories and their odds of being high-qual- 
ity hosts for parasites and pathogens—may 
interact to produce the diversity-disease re- 
lationship. As biodiversity is lost from eco- 
logical systems, the species most likely to 
persist may tend to be those most likely to 
harbor and transmit pathogens at high rates. 

But won't ecological communities with a 
higher diversity of hosts also have a higher 
diversity of parasites and pathogens? And 
might a higher diversity of parasites and 
pathogens counteract the disease-suppress- 
ing effects of high host diversity? These 
questions have been at the heart of critiques 
of the diversity-disease relationship (3, 73). 
An answer may be emerging. Back on the 
German floodplain, Rottstock et al. (1) found 
that the diversity of fungal pathogens was 
indeed higher in the plots with higher plant 
diversity, but the level of infection was lower 
(see the chart). Even the prevalence of co- 
infection—the simultaneous infection of 
a host with more than one pathogen—was 
lower where host and pathogen diversity 
were high. If this pattern of reduced disease 
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with increased parasite diversity occurs in 
other systems, it would help to resolve what 
some have seen as a paradoxical relationship 
between diversity and disease. 

Many fascinating questions remain about 
the relationship between diversity and dis- 
ease, but Civitello et al’s meta-analysis dem- 
onstrates that diversity frequently reduces 
disease. At the same time, Han et al. (9) have 
identified traits in rodents that strengthen 
the connection between life history and the 
probability that a species will be a reser- 
voir for human pathogens. Case studies to 
evaluate diversity effects in specific disease 
systems will remain important, as will the 
pursuit of mechanisms that underlie the 
general pattern. Perhaps the most impor- 
tant question that remains is about the ap- 
plication of this knowledge to public policy. 
Should health protection be added to the 
long list of ecosystem services provided by 
biodiversity (2)? The meta-analysis by Civi- 
tello et al. suggests that it should. m 
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IMMUNOLOGY 


Neutrophil-macrophage communication in 
inflammation and atherosclerosis 


Neutrophils may license macrophages to respond to cholesterol crystals and drive 
inflammation that aggravates atherosclerosis 


By Matthias Nahrendorf 
and Filip K. Swirski 


eutrophils and monocyte-derived 
macrophages are myeloid cells, with 
a shared hematopoietic ancestry, that 
pursue both common and distinct 
immune functions. In response to 
tissue damage or infection, neutro- 
phils infiltrate first, followed by monocytes 
(1). Through a series of events involving the 
release of proinflammatory products, both 
cell types seek to neutralize danger, and 
if successful, inflammation resolves. But 
when stimuli persist and inflammation is 
not resolved, neutrophils and monocytes 
continue to accumulate, presumably in- 
flicting the irreversible damage that char- 
acterizes chronic inflammation. Whether 
neutrophils and monocytes act together to 
fuel the inflammatory cascade in those cir- 
cumstances is still poorly understood. On 
page 316 of this issue, Warnatsch et al. (2) 
report that neutrophils prime macrophages 
for inflammatory responses that aggravate 
atherosclerosis. Neutrophil extracellular 
traps (NETs) underlie the communication 
between these two immune cell types. 

A distinguishing feature of neutrophils is 
their ability to expel cytosolic and nuclear 
material, a suicidal act (NETosis) that en- 
snares extracellular microbes. NETs are 
web-like structures containing DNA, his- 
tones, neutrophil elastase, and myeloperox- 
idase (3) (see the second figure). According 
to Warnatsch et al., NETs can be harmful in 
the setting of inflammatory atherosclerosis. 
In this case, extracellular cholesterol crys- 
tals interact with neutrophils to trigger the 
release of NETs, which prime macrophages 
to produce a precursor form of the inflam- 
matory cytokine interleukin-1f (pro-IL-1B). 
In addition to activating NETosis, choles- 
terol crystals bind to the cell surface pro- 
tein CD36 on macrophages, are internalized 
via endocytosis (4), and activate a signaling 
complex called the inflammasome (5). The 
inflammasome promotes maturation of 
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IL-1B via cleavage of pro-IL-18 with cas- 
pase-1. Thus armed, macrophages present 
in atherosclerotic plaques carry out inflam- 
matory and plaque-destabilizing functions. 
Ultimately, these processes induce throm- 
botic complications that interrupt arterial 
blood supply to downstream tissues, caus- 
ing myocardial infarction and stroke (6). 
Warnatsch et al. identified NETs accru- 
ing adjacent to cholesterol crystals and 
macrophages in atherosclerotic lesions of 
apolipoprotein E-deficient (Apoe-) mice. 
In this model of atherosclerosis, the ab- 
sence of ApoE impairs reverse cholesterol 
transport, resulting in hypercholesterol- 
emia, leukocytosis, and plaque formation 
in the vessel wall. Incubation of human 
neutrophils with cholesterol crystals in 
vitro triggered NETosis—that is, neutro- 
phils died while expelling nuclear material, 
which includes the well-known danger sig- 
nal DNA. To determine whether NETosis 
contributes to atherogenesis, the authors 
engineered Apoe mice that also lack neu- 
trophil elastase and proteinase-3 (Apoe-/ 


Cholesterol crystals 


Macrophage 


oe 


Propagation of 
inflammation 


Ela2‘-/Prin3"-), two serine proteases that 
are secreted during inflammation to de- 
stroy pathogens and host tissue, and local- 
ize to NETs. These triple mutant mice could 
not support NETosis. When consuming a 
diet high in fat and cholesterol, Apoe’/ 
Ela2'-/Prtn3'- mice developed dramati- 
cally smaller atherosclerotic lesions com- 
pared to Apoe’ control animals, despite 
similar lipid concentrations and leukocyte 
counts in blood. In the triple mutant mice, 
the authors found no NETs, lower systemic 
IL-IB concentration, and fewer lesional 
T cells that produce IL-17, a cytokine that 
propagates inflammation by promoting the 
expression of multiple inflammatory cyto- 
kines (although its role in atherosclerosis 
is not firmly established). Moreover, DNAse 
treatment of Apoe’- mice resulted in the 
same small lesion phenotype as seen in the 
Apoe'/Ela2‘-/Prtn3- mice, further sup- 
porting DNA’ atherosclerosis-aggravating 
role. Warnatsch et al. also noted that incu- 
bation of human monocytes first with NETs 
and then with cholesterol crystals resulted 
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Fueling inflammation. Cholesterol crystals and NETosis collaborate to activate the inflammasome in 


murine atherosclerosis. 
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in strong inflammasome activation. The 
observations suggest a two-step inflamma- 
some activation process (see the first figure) 
in murine plaque macrophages in which 
NETs prime a macrophage for inflamma- 
some activation and pro-IL-18 production. 
Cholesterol crystals then induce lysosomal 
damage, which is sensed by the macro- 
phage inflammasome called NOD-like re- 
ceptor family, pyrin domain containing 3 


Deploying NETs. Colored scanning electron micrograph of neutrophil 
extracellular traps (NETs, brown) capturing spores from the yeast 
Candida albicans (yellow). 


(NLRP3). NLRP3 activates caspase-1, which 
subsequently leads to proteolytic cleavage 
and release of IL-1B (5). 

Cholesterol crystal-induced NETosis 
as a macrophage inflammasome stimu- 
lator adds to the growing knowledge on 
neutrophil activity in atherosclerosis, 
including the role of neutrophils in aug- 
menting monocyte recruitment (7) and 
the pro-thrombotic actions of NETs in the 
setting of plaque erosion (8). Do these in- 
teractions occur in human atherosclerotic 
plaque? Although Warnatsch et al. provide 
compelling human in vitro data, and NETs 
are present on the surface of eroding hu- 
man plaque (8), it will be important to 
verify that the observed murine phenotype 
translates to patients. Additionally, it is 
unknown if similar interactions between 
neutrophils and macrophages also occur in 
other organs, as these cells travel far and 
hypercholesterolemia is systemic. Hence, 
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it will be imperative to determine whether 
the phenotype observed in Apoe’-/Ela2‘/ 
Prtn3 mice is exclusive to NETosis; other 
leukocytes, including macrophages, also 
express proteinase-3 (9). The high turn- 
over of plaque macrophages (J0) and the 
observation of NETosis in diabetes (11), a 
common risk factor for atherosclerosis, 
mean the striking phenotype reported 
by Warnatsch et al. could depend on ad- 
ditional mechanisms that still 
await discovery. 

Warnatsch et al. introduce 
an interesting new facet to our 
understanding of how neutro- 
phils and macrophages commu- 
nicate. Macrophage activation 
through NETosis contrasts with 
neutrophil-macrophage interac- 
tion during resolution of inflam- 
mation. When macrophages 
engulf apoptotic neutrophils, a 
noninflammatory macrophage 
phenotype supports healing 
and a return to a steady state 
(1). Thus, the presence of cho- 
lesterol crystals dramatically 
influences cell-cell interactions 
and macrophage function. In 
considering possible therapies, 
inhibiting NETosis, or at least 
eliminating extracellular DNA, 
may prove beneficial. An alter- 
native approach could reduce 
neutrophils by limiting their 
production in the bone marrow, 
migration, and/or recruitment. 
Both strategies would edit the 
instructions that macrophages 
receive from their tissue micro- 
environment, possibly curbing 
inflammasome activation and 
promoting inflammation resolution. Further 
downstream in the pathway, inhibiting the 
active form of IL-18 with a neutralizing anti- 
body is a promising strategy currently being 
tested in patients with atherosclerosis (12). ™ 
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GLOBAL NUTRITION 


Metrics for 
land-scarce 
agriculture 


Nutrient content must 
be better integrated into 
planning 


By Ruth DeFries,’* Jessica Fanzo,” 
Roseline Remans,** Cheryl Palm,’ 
Stephen Wood,” Tal L. Anderman’® 


ver the past half-century, the para- 

digm for agricultural development 

has been to maximize yields through 
intensifying production, particularly 

for cereal crops (J). Increasing pro- 
duction of high-yielding cereals— 
wheat, rice, and maize—has replaced more 
nutrient-rich cereals, which has eroded the 
content of essential dietary nutrients in 
the world’s cereal supply. New approaches 
are needed to produce healthy 

POLICY foods, rich in essential nutri- 
ents, with efficient use of land. 

Standard yield metrics that measure the 
quantity of production are inadequate to as- 
sess progress toward this goal; thus, we pro- 
pose alternative metrics of nutritional yields. 

Intensification of agriculture through 
multiple crops per year, high-yielding seed 
varieties, synthetic fertilizer, mechaniza- 
tion, and other inputs increases yields and 
improves the efficiency of using land to pro- 
duce food. In the last 50 years, intensifica- 
tion of cereal production has increased the 
world’s cereal supply by a factor of almost 
2.2, outpacing the 1.3-fold increase in popu- 
lation growth (2). Intensification was critical 
for averting food shortages that loomed on 
the horizon in the 1960s. 

Intensification has spared 18 to 27 mil- 
lion hectares that would have been required 
to produce the same amount of cereals 
with yields equivalent to those in the mid- 
1960s (1). But intensification can exacerbate 
land-clearing in the absence of appropriate 
policies and enforcement (3). Moreover, in- 
tensification relies on high inputs of energy, 
fertilizer, pesticides, and water (4). Environ- 
mental consequences include runoff of excess 
fertilizer that damages water quality, toxicity 
from pesticides, nitrous oxide emissions, and 
degradation of habitat for biodiversity. 

With increasing competition for land 
(e.g., for carbon storage, watershed protec- 
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the proportion of maize diverted for feed 
remained fairly stable over the time period, 
the change in the mix of cereal types had an 
even larger effect on reducing the nutritional 
content of the global, directly consumed ce- 
real supply than the diversion to feed (table 
S1 and fig. S4). 

In other words, the amount of cereals that 
a person would need to consume to fulfill the 
daily dietary reference intake (DRI) has in- 
creased for protein, iron, and zinc, based on a 
mix of cereals in proportion to the production 
of each type. In 1961, 533, 821, and 735 g of ce- 
reals were needed to satisfy requirements for 
protein, iron, and zinc, respectively. By 2011, 
the amount required increased to 556, 1013, 


and 777 g. Grams required to satisfy energy 
requirements remained nearly unchanged at 
625 to 623 g over this time period. The nutri- 
ent-to-calorie ratio in our directly consumed 
cereal supply has declined, with less nutrient- 
dense cereals contributing to high levels of 
micronutrient deficiencies, particularly in 
low-income settings with cereal-based diets. 
Although our analysis did not address bio- 
availability, our estimates of amounts needed 
to meet requirements would be expected 
to increase because of the relatively low 
bioavailability of micronutrients in grains, 
which depends strongly on processing and 
cooking procedures. 

Declining nutritional content of cere- 
als could be compensated by trade and in- 
creased consumption of other foods (15). 
Diversion of cereals for livestock increases 
consumption of animal products and can 
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compensate for eroding nutritional content 
of cereals but requires more land because of 
low conversion efficiencies of animal feed to 
protein (J6). Moreover, low-income popu- 
lations have limited access to substantial 
amounts of animal products. 


NEW METRICS. The standard yield metric 
of agricultural production by weight per unit 
land neglects the importance of human nu- 
tritional needs as a critical factor for sustain- 
able intensification. We propose a metric of 
“nutritional yield,’ the number of adults who 
would be able to obtain 100% of their recom- 
mended DRI of different nutrients for 1 year 
from a food item produced annually on one 


Healthy foods are needed. They must be rich in 
essential nutrients produced with efficient use of land. 
Sacks filled with wheat in Punjab, India. 


hectare [see supplementary materials (SM)]. 
Conversely, the inverse of the metric indi- 
cates the land area required to grow enough 
food to supply one person with 100% of rec- 
ommended daily requirement of different 
nutrients from a food item for 1 year. 

In reality, people do not obtain 100% of 
recommended nutrient intake from a single 
food item, and processing and cooking fur- 
ther affect the nutrients available for human 
consumption (74). However, the metric al- 
lows comparison among different crops and 
production systems to evaluate nutritional 
value produced from a given land area. The 
metric could be compiled over different food 
items and along food value chains to measure 
nutritional yield for a food system as a whole. 

In 2013, for example, on average one hect- 
are of rice produced 4.5 metric tons/year, 
which is the equivalent of providing the 
annual energy requirement for 19.9 adults. 
Millet produced only 0.9 metric tons/ha per 
year, the annual energy requirement for 4.0 
adults. However, a hectare of rice fulfills the 
annual iron requirement for only 7.6 adults, 
compared with 15.3 for millet. Similarly, oats 
yield more zinc per ha than all other cereals 
except maize, despite providing fewer metric 
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tons and energy per hectare than rice, wheat, 
and maize (see the graph, B). In India and 
LDCs, drought-tolerant millet and sorghum 
provide nutritional yields for iron higher 
than rice, despite relatively low yields in 
terms of metric tons per hectare. In China, 
wheat and rice have the lowest nutritional 
yields of all cereals for iron (fig. S6). 

Decisions about the desired mix of crops 
to achieve efficient use of land differ accord- 
ing to the priority: quantity of production or 
quality to produce essential dietary nutri- 
ents. The nutritional yield metric can guide 
decisions to simultaneously address both 
priorities at multiple scales. At the field and 
farm scales, the metric can quantify benefits 
of improved, nutrition-sensitive agricultural 
practices, such as the use of biofortified va- 
rieties and integrated soil fertility manage- 
ment. At a landscape or national scale, the 
metric can be applied to formulate policies 
that promote a mix of crops that balance 
productivity in terms of quantity and nutri- 
tional needs of the respective populations. At 
the global scale, the metric could be included 
in projections of food demand and require- 
ments for micronutrients and other macro- 
nutrients beyond energy. 

With growing pressures on land resources, 
food systems will be called upon to use land 
efficiently. At the same time, scarce land re- 
sources need to provide adequate nutrition 
for the world’s population and alleviate mi- 
cronutrient deficiencies. This confluence of 
imperatives calls for new alliances, metrics, 
and analyses for incorporating human nutri- 
tion as a primary consideration for sustain- 
able agriculture. ™ 
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SUPRAMOLECULAR CHEMISTRY 


Living supramolecular polymerization 


Greater control is achieved over the chain growth and properties of dynamic materials 


By Rahul Dev Mukhopadhyay’? 
and Ayyappanpillai Ajayaghosh!” 


ost polymers that we encounter, like 
those used in grocery bags and soda 
bottles, are formed from small mol- 
ecules (monomers) held together by 
strong covalent bonds. In supramo- 
lecular polymerization, monomers 
bond through weaker reversible interactions, 
such as hydrogen bonds (H-bonds). Supra- 
molecular polymerization usually proceeds 
through step-growth mechanisms (J), where 
both ends of each monomer are reactive and 
many smaller oligomers form before long 
polymers appear. To overcome this prob- 
lem, a method developed for covalent poly- 
mers known as living polymerization has 
been adopted for supramolecular polymers 
to achieve better control and uniformity of 


chain growth and dispersity. B Amide group Cc 
Living polymerization is a type of chain- Porphyrin 
growth polymerization in which monomers Alkyl chains 
undergo polymerization only upon reacting —m, Monomer 
with an initiator to generate an active center. oe reservoir 
The active site regenerates with each mono- 
mer addition; it propagates along the polymer : -_ 1 
strand before transferring the active center Ivono6in hot solvent) Jase 
to another polymer strand or terminating via 
mutual coupling. In fact, the last two steps of Step 1 Step 3 + 
chain transfer or termination are essentially Cooling Heating 


removed in a living polymerization process. 
The last monomer unit on a polymeric strand 
remains active until deliberately terminated, | Step-growth 
so adding more monomer—or a different pathway 
monomer—resumes the reaction. Properties 
like the degree of polymerization (number of 
monomers in the chain), the chain confor- 


Pre-assembled 
mene seed 


Be 


Pp 1 age(aliquot) 


mation, and its lifetime (of the propagating 1) 4¢¢¢Nanoparticle) 

chain) can therefore be efficiently controlled lie Step 5 

if a chain-growth polymerization is realized Sica d A few hours 
ep 


in such dynamic supramolecular systems (2). 
An early attempt on controlled supramo- 
lecular polymerization was made by Manners, Cooperative 


Winnick, and co-workers (3) by assembling pathway oe Seeded 
polyferrocenyldimethylsilane block copoly- polymerization 
mers in hydrocarbon solvents. Addition of a | 
fresh feed of the polymer in a good solvent _ Step 4 
1 


Several days 


like tetrahydrofuran caused the nanosized Heating 
cylindrical micelle seeds to grow up to mi- 
crometers in length. Very recently, they have 


H-agg 


1 age6Manofiber) 


Formation of micelles and nanofibers. (A) Laser scanning confocal microscopy image (scale bar, 5 um) of self- 


‘Chemical Sciences and Technology Division, CSIR-National assembled polyferrocenyldimethylsilane block copolymer micelles functionalized with fluorescence tunable BODIPY 
Institute for Interdisciplinary Science and Technology (CSIR- di 5 ati : : ‘ei ; 
NIIST), Trivancirurm 695019, india, “Academy of Scientific and (boron-dipyrromethene) derivatives. Molecular structures responsible for different emission colors are represented, 
Innovative Research (AcSIR), CSIR-NIIST, Trivandrum 695019, n-Bu, n-butyl (6). (B) Supramolecular seeded growth of nanofibers from molecule 1 proceeds slowly after first forming 
India. E-mail: ajayaghosh@niist.res.in nanoparticles, but when these nanoparticles interact with existing nanofibers (C), the process is much faster (7). 
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prepared hierarchical [one-dimensional (1D) 
or 3D] multiblock comicelle structures by 
addition of different block copolymers to the 
preformed micellar seeds, which were stable 
both in solution and in solid state (4). “Liv- 
ing crystallization’-driven block copolymer 
self-assembly was also used to prepare func- 
tionalized block copolymers (see the first 
figure, panel A) with segments of different 
emission colors (5, 6). 

Later, Sugiyasu, Takeuchi, and co-workers 
showed that systems undergoing self-assem- 
bly following a nucleation-elongation mech- 
anism coupled with a kinetically controlled 
pre-equilibrium process results in living su- 
pramolecular polymers (7). These authors 
observed that a porphyrin-based molecule 
initially forms that “wrong product,” the for- 


nanofibers of the organogelator as seeds. 
Although the above reports have taken us 
a few steps closer to the concept of living su- 
pramolecular polymerization, unimolecular 
control on chain growth remained elusive 
until recently. Miyajima, Aida, and co-work- 
ers (9) reported an interesting property of 
a nonplanar bowl-shaped corannulene mol- 
ecule appended with five amide-functional- 
ized thio alkyl chains (see the second figure, 
panel A). This C,-symmetric molecule did 
not undergo self-assembly via intermolecu- 
lar H-bonding in methylcyclohexane (MCH) 
because the formation of intramolecular 
H-bonds between the five amide chains is 
more conformationally feasible. However, 
upon heating, this metastable cagelike 
monomer opened up to undergo a sponta- 


A Living polymerization without termination: I--M--M--M--M--M--M--M--M--M--M--M--M 


H-bond acceptor 


H-bond donor 


Ss a 
% Sh “Monomer (M) 


Initiator (1) 


B_ Concept of optical resolution 


mation of so-called “J-aggregates” that lead to 
metastable organic nanoparticles. After sev- 
eral days, these nanoparticles transformed 
into stable “H-aggregates” with nanofiber- 
like morphology via a cooperative mecha- 
nism (see the first figure, panel B). When an 
aliquot of the nanofibers was added to a so- 
lution of the nanoparticles, which serve as a 
reservoir for the porphyrin monomers, nano- 
fibers formed much more rapidly (within a 
few hours) through a living polymerization 
process (see the first figure, panel C). 
Recently, a clever molecular design by 
Wiirthner and co-workers (8) allowed a pro- 
longed lag time in the self-assembly process 
of a perylene bisimide-based organogela- 
tor by locking the molecule in an inactive 
conformation via induced intramolecular 
H-bonding. A living polymerization process 
was achieved by introducing preassembled 
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neous 1D self-assembly. Once such assembly 
formed, it continued to grow with a fresh 
supply of the metastable monomer even at 
low temperatures (9). 

Further work by Miyajima, Aida, and 
co-workers used corannulene molecules 
functionalized with methyl-substituted 
amides as the initiator of the chain-growth 
polymerization because these molecules re- 
main in an open conformation in the ab- 
sence of intramolecular H-bonds and serve 
as suitable proton acceptors when a caged 
corannulene molecule approaches (10). Po- 
lymerization was initiated only from one 
face of the monomer and promoted growth 
only in a particular direction. The resulting 
polymers were robust enough to remain 
stable even at a 10-fold dilution, so it was 
possible to analyze them by size exclusion 
chromatography. 
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When chiral initiators were used, po- 
lymerization occurred only when the con- 
figuration of the stereogenic center of the 
monomer matched with that of the initiator. 
The monomer (M) with achiral side chains 
consisted of an asymmetric center at the 
corannulene core that formed a racemic 
mixture through a bowl-to-bowl inversion 
process. The chiral initiators (I) could dif- 
ferentiate between the enantiomers of M 
and undergo polymerization with selective 
handedness. This property was used to op- 
tically resolve a racemic mixture of mono- 
mers M, and M, by using either I, or I, as 
an initiator (see the second figure, panel B). 

Supramolecular living polymerization 
is a step closer to the precision synthesis 
of complex architectures and presents us 


Unimolecular control of chain 
growth. (A) Unimolecular control 

in a supramolecular living chain- 
growth polymerization with initiator 
and monomer M. (B). Schematic 
representation of optical resolution 
process is shown. The different M and | 
derivatives are represented below; Me, 
methy! (10). [Figures reproduced with 
permission from (6), (7), and (10)] 


with a broad canvass to work toward new- 
generation functional materials. Controlled 
living polymerization in systems like triaryl- 
amines (77) or donor-acceptor molecules re- 
mains challenging with respect to the design 
of organic electronic devices with optimum 
light-conversion efficiency. ™ 
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ARTIFICIAL INTELLIGENCE 


Virtual love 


A young programmer falls 
for a humanoid robot, but is 
the feeling mutual? 


By Rosalind W. Picard 


n the opening scene of Ex Machina, 
Caleb Smith, a tall, young programmer, 
has just won a contest to spend a week 
at the home of his company’s founder, 
Nathan Bateman. Bateman is the genius 
programmer behind Bluebook, a cor- 
poration handling 94% of the world’s web 
search requests. On arriving at the isolated 
retreat, Caleb learns that he will be the first 
person to see if Nathan’s newest creation, the 
robot Ava, passes the Turing test. The clas- 
sic test, developed by Alan Turing, has the 
computer hidden from the human, who asks 
questions, receives answers via text, and in 
the end must judge, “Is this a human or a 
computer?” When Caleb points out that this 
will be different—that he will know that it’s 
a computer from the start—Nathan argues 
that it will be all the more impressive if Ca- 
leb knows that Ava is a computer and yet be- 
comes convinced that she is conscious. 
When Caleb meets Ava, we see close ups 
of Ava’s face and shots framing her sculpted 
breasts, transparent waistline, and inter- 
nally illuminated organs. Caleb responds as 
any healthy 26-year-old heterosexual male 
might, with a slack jaw. When Nathan asks 
Caleb, “How do you feel about her?” Caleb is 
effusive: “She’s fucking awesome.” The more 
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interesting question, however, is what Nathan 
asks next, “How does she feel about you?” 

As Caleb and Ava supposedly grow closer, 
we watch for the emotions to develop. When 
Caleb shares that his parents died when he 
was 15, Ava’s face remains unmoved; her 
pause is only perfunctory before changing the 
topic. Later, we see experiences that should 
elicit expressions of pain and do not. Her face 
reminds me of the face of a dermatologist I 
once met who had experimented a bit too ea- 
gerly with Botox. Scientists have shown that 
we read a neutral facial expression as happy 
if we felt happy before we saw it or as slightly 
sad if we felt slightly sad before we saw it. We 
may buy that Ava has feelings because she 
produces words that describe feelings and be- 
cause Nathan said “she can feel pleasure,” but 
we do not see her demonstrate convincing 
empathy or emotional intelligence (7). 

At one point, Caleb recognizes that Ava 
has “mind-reading” abilities. He does not 
mean the “I know all your thoughts” kind 
of mind reading that is routinely debunked 
but the ability to infer the likely mental state 
of another person—a simple task for most 
people. For example, if your phone is placed 
under a basket while you watch, and then 
moved while you are out of the room, when 
you come back to retrieve it, a “mind reader” 
would expect you to look for it where it had 
been and for you to be surprised to find that 
it is not where you left it. Ava speaks lines 
that imply that she is aware of minds and 
can reason about them. Some scientists argue 
that a lack of mind reading is the hallmark of 
autism, although there is also ample counter- 
evidence to this. As such, Nathan’s remark 
that autistic people are not aware of their 
own minds, or those of others, is the first line 
I would have put on the cutting room floor. 

When Caleb tries to engage Nathan intel- 
lectually about how Ava works, keywords like 
“stochastic” and “linearized” are sprinkled 
into the conversation in a way that is in- 
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In Ex Machina, director 
Alex Garland explores f 
the implications of 


Ex Machina 

Alex Garland, director 
DNA Films, 2015. 

108 minutes. 


tended to sound erudite without boring the 
uninitiated. To those of us who build emo- 
tional artificial intelligence (AI), however, the 
phrases have the sound of a toddler randomly 
sampling keys on a Steinway grand piano. 
Nonetheless, the director, Alex Garland, 
has done some homework on the science of 
bringing emotions, emotional intelligence, 
and more to AI. He is wise, for example, to 
stretch the Turing test over multiple days 
(2). He is also clearly aware of how valuable 
it is to collect massive data to train the sys- 
tem. At one chilling point, Nathan tells Caleb 
how Ava was taught to read and synthesize 
facial expressions by turning on every camera 
and microphone in every cell phone on the 
planet and recording everybody without their 
knowledge. Still, the film leaps over the major 
breakthroughs that would be required before 
we could encounter such a future, namely, 
the complete lack of evidence that machines 
could ever have conscious feelings like ours. 
A particularly glaring gap is the fact that 
Ava is programmed without morality. Some 
might consider morality an option or an 
upgrade; however, smart machines do not 
evolve unguided. A machine’s choices are 
significantly biased by the procedures with 
which it has been programmed. Ava, and her 
actions, say more about the mind of her pro- 
grammer than about the robot he created. 
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PUBLIC HEALTH 


Food for 
thought 


A beginner’s guide 
to ethical eating 


By Nicholas Freudenberg 


hould I get my morning coffee at 
Dunkin’ Donuts, where it is cheap but 
where it will be served by workers 
who are not paid a living wage (7)? Or 
should I go to Starbucks, where it will 
cost much more but workers get better 
pay and benefits (2)? When I shop for dinner 
tonight, should I buy organic, free range, or 
natural beef—or skip meat altogether to sup- 
port the rights of cows and the well-being of 
our planet? And when Monsanto urges me to 
reject a legislative proposal to require labels 
on genetically modified food, should I accept 
their argument that labeling will make food 
more expensive—especially for the poor? Or 
should I instead listen to my environmen- 
talist friends who assert that we should not 
expose humans and the planet to modified 
organisms whose long-term health and envi- 
ronmental impacts are unknown? 

These are among the questions Paul B. 
Thompson examines in From Field to Fork: 
Food Ethics for Everyone. His goal is not to 
tell us what to eat but to apply the methods 
of philosophy and ethics to the choices in- 
dividuals and societies have to make about 
food. He hopes to help readers find the right 
balance between “enabling choices vs. im- 
posing values on others.” His refreshing be- 
lief is that public dialogue and debate on the 
values and ethics of key food questions will 
enable policy-makers and the public to make 
more informed and equitable choices. 

Thompson is a philosopher at Michigan 
State University, where he holds the W. K. 
Kellogg Chair in Agricultural, Food, and 
Community Ethics. His 30 years’ experi- 
ence at the crossroads of philosophy, food, 
and agriculture make him a knowledgeable 
guide to food ethics, a field he helped to cre- 
ate. This book considers a variety of topics 
that have attracted media and public atten- 
tion: hunger and food insecurity in a land 
of plenty; the impact of the Green Revolu- 
tion and genetically modified crops on food 
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production, famine, and the environment; 
the ethical, health, and environmental ratio- 
nales for vegetarianism; and the human cost 
of cheap food (workers paid so little they 
must depend on public benefits to survive). 

Thompson’s deep knowledge gives him 
some unexpected insights. Although he 
doubts that existing evidence on genetically 
modified organisms (GMOs) warrants re- 
strictions for health or environmental rea- 
sons and accepts the contention that they 
can help make food more available to the 
world’s poor, he argues that their widespread 
use endangers traditional forms of agrarian- 
ism. The ancient Greeks, Thomas Jefferson, 
and current agroecologists have all claimed 
that small farmers are the foundation for 
democracy and equality. By patenting seeds 
and dominating global markets, a few mul- 
tinational companies have wrested decisions 
about how to farm from small growers and 
thus diminished their power to shape rural 
economies and politics. Thompson argues 
that these and other doubts about GMOs 
warrant continued skepticism about ex- 
panding their use. 

Thompson recognizes that how we grow, 
sell, and eat food cannot be separated from 
our economic and political systems. “Since 
everyone needs to eat,’ he writes, “con- 
trol of food is a powerful locus for both 
profit-taking and the exercise of social con- 
trol.” However, the book does not directly 
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From Field to Fork examines how our food choices 
affect society and the planet. 


consider some of the most urgent ethical 
questions confronting our food system. For 
example, are the business models of PepsiCo 
and McDonald’s ethical when they depend 
on persuading individuals to consume high- 
sugar, -fat, and -calorie products that are 
associated with costly diet-related diseases? 
And is New York City imposing a “nanny 
state” or simply living up to its mandate to 
protect public health when it seeks policies 
designed to restrict portion sizes or add 
warning labels to unhealthy foods? By not 
addressing such questions, Thompson leaves 
two key players in our food system—global 
food companies and government—curiously 
absent from his analyses. Because changing 
the practices of these two food behemoths is 
the goal of the global food justice movement, 
a deeper analysis of the ethical frameworks 
each uses could have helped to advance the 
search for sustainable and moral solutions to 
global food problems. 

Despite this gap, From Field to Fork is a 
sensible and engaging introduction to food 
ethics. Thompson avoids moralizing and 
rejects the tendency of some scientists to 
assume that evidence alone can settle food 
policy fights without engaging with the 
moral arguments of critics. By emphasizing 
that our food system ultimately reflects our 
values, Thompson sets the table for more 
satisfying discussions of food policy. 
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Holocene as 
Anthropocene 


IN THEIR PERSPECTIVE “Defining the 
epoch we live in” (3 April, p. 38), W. F. 
Ruddiman et al. write that in spite of its 
popularity, the Anthropocene still lacks an 
official onset. They propose that the term 
anthropocene be used informally (without 
the initial capital), which would avoid 
the constraints of a formal designation. 
We disagree. 

The human footprint on the planet is 
so distinctive and lasting that the Age 
of Man must be officially recognized 
in the geologic time scale. We support 
considering the Anthropocene and the 
Holocene as a single geologic time span. 
This solution has already been pro- 
posed (7) but was inexplicably ignored. 
Combining the two epochs would provide 
the Anthropocene with a climatic jus- 
tification (2). It was the end of the last 
glacial period that allowed the increase 
of the human population and its role as 
a geologic force, and anthropic impacts 
on the planet took place during most of 
the Holocene. Moreover, the latest studies 
are placing the appearance of domes- 
ticated plants and animals—which can 
plausibly be considered the onset of the 
Anthropocene (3)—increasingly earlier in 
time, closer and closer to the 11.7 ky B.P. 
Pleistocene-Holocene boundary (4). 

Even if the impact of humans was not 
immediately major and uniform across 
the planet, what are a few thousand 
years of discrepancy between the end of 
the last glacial period and the dawn of 
the Anthropocene in comparison to the 
entire geological scale? Most transitions 
between the formally defined epochs 
would be much longer than the Holocene. 
Anthropocene seems a more reasonable 
name than Holocene for this combined 
time span, whose most characteristic 
trait is the human pressure on the planet. 
Holocene could possibly be the first stage 
of the Anthropocene, the one character- 
ized by a soft and spotty human impact 
on Earth. 
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Geological evidence 
for the Anthropocene 


DEFINITIONS ARE ONE of the bedrocks 
of science. However, W. F. Ruddiman 
et al. (“Defining the epoch we live in,” 
Perspectives, 3 April, p. 38) propose that 
the geological term Anthropocene should 
remain deliberately undefined and ambigu- 
ous. They recommend not formalizing the 
term because some inception dates miss 
important earlier human-induced environ- 
mental impacts, particularly widespread 
farming. We have sympathy for this view. 
However, defining the Anthropocene 
as a geological epoch must be based on 
evidence, including changes to the Earth 
system lasting millions of years and the 
existence of stratigraphic evidence mark- 
ing such changes (1). All other geological 
time-units have agreed-upon dated markers 
or agreed-upon dates. The Anthropocene 
should not be treated differently. 
Ruddiman e¢ al. discuss a recent paper 
suggesting that the Anthropocene began 
in 1945 (2). Zalasiewicz et al. (2) note, as 
have others (3), that many stratigraphic 
changes are coeval with the mid-to-late 
20th century. However, they then side-step 
geological convention by not specifying a 
Global Stratotype Section and Point (GSSP) 
to define the Anthropocene. Instead, they 
choose the date of the first nuclear weapon 
detonation (2), which created no strati- 
graphic evidence in 1945. Given that GSSPs 
are the preferred method of marking 
geological time units (1), 1945 is unlikely 
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oe 


to obtain the backing of the multiple com- 
mittees of geologists necessary to formally 
define the Anthropocene epoch (4). 

The appropriate GSSP marker relating 
to nuclear weapons is the globally measur- 
able peak in radionuclide fallout, 
in 1964, after the Partial Test Ban Treaty 
(3). An earlier date also likely adheres 
to the geological criteria for defining an 
epoch: 1610. This date marks the irrevers- 
ible exchange of species following the 
collision of the Old and New worlds; an 
associated unusual drop in atmospheric 
CO, captured in Antarctic ice cores 
provides the GSSP marker (3). This date 
implicitly includes the impacts of 
farming, as the drop in CO, largely 
resulted from vegetation regrowth on 
abandoned farmlands following the 
deaths of 50 million indigenous Americans 
(mostly from smallpox brought by 
Europeans). The annexing of the Americas 
by Europe was also an essential precursor 
to the Industrial Revolution and therefore 
captures associated later waves of envi- 
ronmental change (3). Choosing among 
alternative inception dates will be chal- 
lenging (5). 

Finally, Ruddiman and colleagues sug- 
gest using the term Anthropocene with a 
small “a” to denote its informality. This is 
not consistent with geological-community 
norms. Formal time-unit names have a 
capital letter: Anthropocene Epoch versus 
Anthropocene epoch for the informal term 
(, 3). An elegant solution to referring to 
time before an agreed-upon Anthropocene 
Epoch is already available: the paleoan- 
thropocene (6). Geological norms should 
be followed when considering defining the 
Anthropocene. 

Simon L. Lewis’? and 
Mark A. Maslin’ 
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Response 


CERTINI AND SCALENGHE and Lewis 

and Maslin seem convinced that geo- 
science must rely heavily on formal 
stratigraphic nomenclature to move 
forward. This view likely originates from 
the centuries-long effort to compile the 
relative geologic age sequence, which was 
a monumental achievement, especially 
given religious and social opposition. The 
compilation required careful attention to 
stratigraphic principles such as superposi- 
tion, index fossils, diachroneity, and facies 
changes. Over time, the findings became 
formalized in geologic nomenclature. 

However, since the mid-1900s, geochem- 
ical (radiometric) dating has increasingly 
overshadowed (although not replaced) 
those time-honored principles. In my 
experience, most geoscience papers in 
recent decades initially refer to geologic 
nomenclature as a way of providing a 
familiar frame of reference, but they soon 
shift focus to the best geochemical dating 
available. Although my view will displease 
some geological colleagues, isn’t it time 
to acknowledge that almost all dating of 
geological sequences now takes place in 
the “geochemical age” (with a small “a”)? 

Lewis and Maslin propose that we 
designate a “Paleoanthropocene’” (capital 
“P”). Should we place its start at the mass 
extinction of marsupials in Australia near 
50,000 years ago? Or should we choose the 
mass extinction of large mammals in the 
Americas almost 40,000 years later? Or, if 
we decide to pick a level during the 10,000 
years or more when crops and livestock 
were being domesticated and agriculture 
was spreading across the continents, 
which one time within that long span 
should we use for a formal designation? 
Could any choice be satisfying, given 
that it would have to ignore the rest of 
the long and rich history of early human 
influences? 

In contrast, informal use of these terms 
(anthropogenic, anthropocene, and paleo- 
anthropocene) would present no such 
problems or constraints. 

William F. Ruddiman 
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TECHNICAL COMMENT 
ABSTRACTS 


Comment on “Sedimentary DNA from 
a submerged site reveals wheat in the 
British Isles 8000 years ago” 

K. D. Bennett 

Smith et al. (Reports, 27 February 2015, p. 
998) identify wheat DNA from an 
8000-calendar-years-before-the-present 
archaeological site in southern England and 
conclude that wheat was traded to Britain 
2000 years before the arrival of agriculture. 
The DNA samples are not dated, either 
directly or from circumstantial evidence, so 
there is no chronological evidence to support 
the claim. 

Full text at http://dx.doi.org/10.1126/science. 
aab1886 


Response to Comment on “Sedimentary 
DNA from a submerged site reveals 
wheat in the British Isles 8000 years ago” 
Oliver Smith, Garry Momber, Richard Bates, 
Paul Garwood, Simon Fitch, Mark Pallen, 
Vincent Gaffney, Robin G. Allaby 

Bennett questions the rigor of the dating of 
our sample from which sedimentary ancient 
DNA was obtained and the reliability of 

the taxonomic identification of wheat. We 
present a further radiocarbon date from 
$308 that confirms the lateral consistency 
of the palaeosol age. The suggestion of 
taxonomic false positives in our data 
illustrates a misinterpretation of the 
phylogenetic intersection analysis. 

Full text at http://dx.doi.org/10.1126/science. 
aab2062 


ERRATA 


Erratum for the Report “Mutation rate 
and genotype variation of Ebola virus from 
Mali case sequences” by T. Hoenen ef al., 
Science 348, aac5674 (2015). Published online 
22 May 2015; 10.1126/science.aac5674 


Erratum for the Report: “Molten uranium 
dioxide structure and dynamics” by L. B. 
Skinner et al., Science 348, aab3869 (2015). 
Published online 1 May 2015; 10.1126/science. 
aab3869 


Correction for the Report: “The in vivo 
dynamics of antigenic variation in 
Trypanosoma brucei” by M. R. Mugnier 
et al., Science 347, aaa4502 (2015). Published 
online 22 April 2015; 10.1126/science.aaa4502. 
Fig. 2A contained errors in the print version 
of Science. The correct figure is displayed 
online. 


Erratum for the Research Article: “Neu- 
rotransmitter switching in the adult brain 
regulates behavior” by D. Dulcis et al., 
Science 348, aab2338 (2015). Published online 
10 April 2015; 10.1126/science.aab2338 
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TECHNICAL COMMENT 


ARCHAEOLOGY 


Comment on “Sedimentary DNA from 
a submerged site reveals wheat in 
the British Isles 8000 years ago” 


K. D. Bennett” 


Smith et al. (Reports, 27 February 2015, p. 998) identify wheat DNA from an 8000-calendar- 
years-before-the-present archaeological site in southern England and conclude that 
wheat was traded to Britain 2000 years before the arrival of agriculture. The DNA samples 
are not dated, either directly or from circumstantial evidence, so there is no chronological 


evidence to support the claim. 


urrent understanding of the spread of ag- 
riculture across Europe indicates arrival in 
Britain about 6000 years ago, having taken 
about 2000 years to spread across central 
Europe and the English Channel from the 
Mediterranean (7). There is abundant evidence 
for the occurrence of early forms of wheat in Britain 
after 6000 calendar years before the present (cal 
yr B.P.), so the main claim of Smith et al’s inves- 
tigations at Bouldnor Cliff southern England, is 
the early date of about 8000 cal yr B.P. (2), rather 
than the discovery of wheat itself. 
The samples examined for DNA (2) come from 
a monolith, S308, collected “from a location at the 
site” [p. 999 in (2)], or “in the proximity MS-08 
and MS-05, which were adjacent to each other” 
[p. 4 in the supplementary materials (SM) for (2)]. 
Nowhere in the Report is there any mention of a 
measurement between S308 and other monoliths, 
either horizontally or vertically. It is not clear how 
much time elapsed between the collection of the 
dated monoliths and S308. All the radiocarbon 
dating was carried out on monoliths other than 
$308 [table S1 in the SM for (2)]. No information 
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is provided on other analyses that would provide 
a firm link between the DNA analyses of S308 
and any sample of the dated monoliths. Sedi- 
mentary analyses were carried out on the other 
monoliths (3) and might easily have been applied 
to S308 to provide a link even if, for some reason, 
it was not possible to obtain a radiocarbon date 
directly from the sediment matrix (within which 
the wheat DNA was obtained) of S308. 

The stratigraphic section for Bouldnor Cliff 
[figure 1B in (2)] shows a complex series of coast- 
al sediments, dipping into a channel. This raises 
the possibility that samples taken later, even if 
at the same measured location (horizontal and 
vertical) as the originals, might have come from 
a stratigraphically different part of the sequence, 
depending on accuracy and precision of both 
horizontal and vertical measurements, as well as 
allowance for erosion between the dates of sam- 
ple collection. There are also inconsistencies in 
the presentation of the stratigraphy, including the 
relative extents of monoliths MS-04 and MS-07 
[separated by a gap in figure 1B in (2) but con- 
tiguous in figure S2 in the SM for (2)]. Reference 
is made to a paleosol, which might incorporate 
material from a wide range of ages. 

The only evidence for wheat at the site comes 
from DNA. There are no wheat macrofossils or 
wheat pollen in the samples. The contention is 
that this indicates that the wheat did not come 
from nearby agriculture (2). However, whenever 


the wheat was incorporated in the sediment 
(from nearby agriculture or trading), it must have 
been associated with macrofossil remains, such 
as wheat grains. Both hazel and alder macrofos- 
sils (but no DNA) were found in or near S308 
(2), so it is curious that no trace of wheat re- 
mains have been found. There are inconsisten- 
cies in the DNA—even after the phylogenetic 
intersection analysis (PIA)—including the pres- 
ence of DNA from animals known in the mid- 
Holocene only from regions far removed from 
southern Britain, such as Ursus maritimus (polar 
bear, Arctic) and Cervus nippon (Sika deer, East 
Asia), and from tropical grasses (Panicoideae). 
These suggest that some false positives are coming 
through PIA or that the taxonomic precision in- 
dicated is inappropriate for the content of the 
database, which might be relevant to the age or 
origin of the wheat identification. 

Given that S308 contains DNA of wheat, how 
might it have arrived there? There are a number 
of possibilities. The samples may be contempo- 
raneous with the monoliths (2), so the wheat may 
have been traded from Europe, presumably all the 
way from the Mediterranean at 8000 cal yr B.P. 
(D). Alternatively, if the samples are actually much 
younger, the wheat may have been incorporated 
from nearby agriculture, perhaps from eroded 
soil, and ended up near to early Holocene sam- 
ples through the vagaries of sedimentation in a 
complex channel and coastal situation. The lat- 
ter is the more parsimonious view, in the absence 
of evidence to the contrary. Finally, the identifica- 
tion of the wheat DNA may be another false posi- 
tive in the comparisons with database DNA. 

The claim that wheat was being traded to 
Britain at 8000 cal yr B.P. has substantial impli- 
cations for understanding of the archaeology of 
northwest Europe in the early postglacial pe- 
riod. Such a claim can be readily substantiated by 
dating the sediment matrix of the samples direct- 
ly. A claim based on undated samples, lacking 
any incontrovertible link to dated samples, is 
insufficient to overturn current understanding. 
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Response to Comment on 
“Sedimentary DNA from a submerged 
site reveals wheat in the British 

Isles 8000 years ago” 


Oliver Smith,’ Garry Momber,” Richard Bates,’ Paul Garwood,* Simon Fitch,” 
Mark Pallen,®* Vincent Gaffney,’”* Robin G. Allaby’*+ 


Bennett questions the rigor of the dating of our sample from which sedimentary 
ancient DNA was obtained and the reliability of the taxonomic identification of wheat. 
We present a further radiocarbon date from S308 that confirms the lateral consistency of 
the palaeosol age. The suggestion of taxonomic false positives in our data illustrates a 
misinterpretation of the phylogenetic intersection analysis. 


ennett (J) raises the point that possible 

variances in the depositional environment 

make it inappropriate to apply radiocarbon 

dates from nearby monoliths MS-05 and 

MS-08 in our study (2), raising the pos- 
sibility that the wheat DNA we identified could 
possibly be younger than the 8000 years we 
claim. The evidence from the radiocarbon dates 
at the site and those we presented is that the 
palaeosol is of a consistent age over a 420-m area 
reaching between the sites of Bouldnor Cliff II, 
on which our study was based, and Bouldnor 
Cliff V (3). We therefore felt that the site was 
dated securely enough in this study. However, 
we accept that Bennett’s argument is most ro- 
bustly refuted by obtaining a radiocarbon date 
from the sample S308, which is a box sample 
taken from the cliff as outlined in the methods 
and not, as Bennett suggests, a monolith. We 
attempted to date both a twig from the sample 
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and the humic acid fraction of the sediment 
itself. Sediment dating is problematic because 
of the risk of inclusion of older carbon sources 
and can lead to overestimation of age. In this 
case, the humic acid content of the sandy clay was 
too low to provide a direct sediment date, but the 
twig returned an age of 7935 to 7790 calendar 
years before the present (Beta-406961), confirm- 
ing the age of the sample from which the sedi- 
mentary ancient DNA (sedaDNA) was obtained. 

Bennett asserts that the phylogenetic inter- 
section analysis (PIA) fails to filter out exotic 
species such as Ursus maritimus (polar bear), 
Cervus nippon (Sika deer), and tropical pani- 
coids, and as such casts doubt on the validity of 
the identity of the wheat DNA. This is incorrect 
and shows some misunderstanding of the DNA 
analysis. The analysis is shown to be highly ro- 
bust and does not in any way falsely identify 
polar bears, Sika deer, or tropical grasses as 
being present at the site. These are instances of 
closest match between sedaDNA reads and the 
database; it would be naive to interpret these 
as a species identification, for reasons we ex- 
plained in some depth in the supplementary 
materials of (2). These are not instances of reads 
that have been filtered out by the PIA, but rather 
meet the criteria of the analysis. The PIA is pred- 
icated on the fact that, due to variable data- 
base representation, the species of origin may 
not be present in the database; indeed, often, 
large taxonomic orders are represented by only 
a very few species for many genomic regions. 
The robustness of the analysis comes from 


examining the phylogenetic range of similar DNAs 
within a database. In fact, Bennett is alluding to 
sedaDNA that is attributed robustly to Carnivora, 
Caniformia, or Ursidae, in the case of the “polar 
bear,” meaning that the true species of origin 
lies somewhere in the phylogenetic range en- 
compassed by this order, suborder, or family, 
respectively. Similarly, cervinae is identified in 
the case of the “Sika deer,” and various uncon- 
troversial higher taxonomic orders of grasses are 
identified in the analysis in which the closest data- 
base match is from a tropical grass such as rice. 

Factors such as genome size and how well 
genomes are represented in databases have a 
great influence on the resulting frequency in 
the DNA profile. Wheat has a large and well- 
characterized genome, and, furthermore, many 
wheat species and the sister species Aegilops 
have been characterized. This means that we 
have a great power to detect wheat relative to 
other organisms from metagenomic profiles 
using the PIA, which is reflected in the relatively 
large number of wheat sequences we identify. 
Given the high level of accuracy (81%) of the 
analysis, the evidence for the presence of wheat 
at Bouldnor is overwhelming. 

Finally, Bennett asserts that sedaDNA re- 
quires the existence of macrofossil sources of 
DNA. The sandy clay from which we took the 
sedaDNA sample was largely devoid of macro- 
fossils save for a few twigs, with a low organic 
content, despite the large number of taxonomic 
orders that were identified. Bouldnor Cliff has 
not been extensively excavated yet, and it is pos- 
sible that the quantities of grain involved may 
be very small and that such wheat macrofossils 
remain to be discovered. However, the presence 
of sedaDNA in the absence of macrofossils is 
precedented (4). We can make no conclusion 
about the source of the DNA without further 
evidence. However, if rapid transport occurred— 
such as might be expected, for instance, using 
boats associated with pioneer groups on the 
western coast of France—then the source may 
even have been flour rather than grain. 
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RISE OF THE 


SPECIAL SECTION 


MACHINES 


By Jelena Stajic, Richard Stone, Gilbert Chin, and Brad Wible 


Ithough most would agree that the average person is smarter 
than the average cat, comparing humans and machines is 
not as straightforward. A computer may not excel at abstract 
reasoning, but it can process vast amounts of data in the blink 
of an eye. In recent years, researchers in artificial intelligence 
(AI) have used this computational firepower on the scads of data 
accumulating online, in academic research, in financial records, 
and in virtually all walks of life. The algorithms they develop 
help machines learn from data and apply that knowledge in 
new situations, much like humans do. The ability of computers to extract 
personal information from seemingly innocuous data raises privacy 
concerns. Yet many AI systems indisputably improve our lives; for 
example, by making communication easier through machine translation, 
by helping diagnose illness, and by providing modern comforts, such as 
your smartphone acting as your personal assistant. This special issue 
presents a survey of the remarkable progress made in AI and outlines 
challenges lying ahead. 

Many AI systems are designed for narrow applications, such as playing 
chess, flying a jet, or trading stocks. AI researchers also have a grander 
aspiration: to create a well-rounded and thus more humanlike intelligent 
agent. Scaling that research peak is daunting. But triumphs in the field 
of AI are bringing to the fore questions that, until recently, seemed better 
left to science fiction than to science: How will we ensure that the rise of 
the machines is entirely under human control? And what will the world 
be like if truly intelligent computers come to coexist with humankind? 


The editors gratefully acknowledge the advice of Eric Horvitz (Microsoft) on the 
Reviews in this special issue. 
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Ellie, a virtual health agent, 
monitors patients’ expression: 
gestures, and voi 


The synthetic therapist 


Some people prefer to bare their souls to computers 
rather than to fellow humans 


eople have always noticed Yrsa 
Sverrisdottir. First it was ballet, 

which she performed intensively 

while growing up in Iceland. Then 

it was science, which she excelled 

at and which brought her to the 

stage at conferences. And starting 

in 2010, when she moved to the 
University of Oxford in the United 
Kingdom to study the neurophysiology of the 
heart, it was her appearance. With her Nor- 
dic features framed by radiant blonde hair, “I 
just stand out here,” she says. “I can’t help it.” 
After she arrived in the United Kingdom, 
she found that she no longer enjoyed the 
attention. She began to feel uncomfortable 
in crowds. Her relationships suffered. There 
had been some obvious stressors, such as 
the deaths of both of her parents. But the 
unease didn’t let up. By 2012, she says, “I felt 
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like I was losing control.” Then she met Fjola 
Helgadottir, one of the few other Icelanders 
in town. Helgadottir, a clinical psychology 
researcher at Oxford, had created a com- 
puter program to help people identify and 
manage psychological problems on their 
own. Sverrisdottir decided to give it a try. 
The program, based on a technique 
called cognitive behavioral therapy (CBT) 
and dubbed CBTpsych, begins with sev- 
eral days of interactive questioning. “It 
was exhausting,” Sverrisdottir says. The 
interrogation started easily enough with 
basic personal details, but then began to 
probe more deeply. Months of back and 
forth followed as the program forced her 
to examine her anxieties and to identify 
distressing thoughts. CBTpsych diagnosed 
her as having social anxiety, and the in- 
sight rang true. Deep down, Sverrisdottir 
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realized, “I didn’t want people to see me.” 

Then the program assumed the role of 
full-fledged therapist, guiding her through 
a regimen of real-world exercises for tak- 
ing control. It sounds like a typical success 
story for clinical psychology. But no human 
psychologist was involved. 

CBTpsych is far from the only computer- 
ized psychotherapy tool available, nor the 
most sophisticated. Ellie, a system built at 
the University of Southern California (USC) 
in Los Angeles, uses artificial intelligence 
(AD) and virtual reality to break down barri- 
ers between computers and humans. Origi- 
nally funded by the U.S. military, its focus 
is on diagnosing and treating psychological 
trauma. Because patients interact with a 
digital system, the project is generating a 
rare trove of data about psychotherapy itself. 
The aim, says Albert “Skip” Rizzo, the USC 
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psychologist who leads the effort, is nothing 
short of “dragging clinical psychology kick- 
ing and screaming into the 21st century.” 

A19 June editorial in The New York Times 
deemed computerized psychotherapy “ef- 
fective against an astonishing variety of 
disorders.” The penetration of the Internet 
into far-flung communities could also bring 
mental health treatment to vast numbers of 
people who otherwise have no access. 

But whether clinical psychologists will 
accept AI into their practice is uncertain. 
Nor is it clear that the tools of AI can carry 
computerized psychotherapy beyond its so 
far limited capacity, says Selmer Bringsjord, 
a cognitive scientist and AI researcher at 
Rensselaer Polytechnic Institute in Troy, New 
York. “It is incredibly ambitious.” 


ALL OF TODAY’S VIRTUAL psychologists 
trace their origins to ELIZA, a computer pro- 
gram created half a century ago. Named after 
the young woman in Pygmalion who rapidly 
acquires sophisticated language, ELIZA was 
nothing more than a few thousand lines of 
code written by Joseph Weizenbaum and 
other computer scientists at the Massachu- 
setts Institute of Technology 
(MIT) in the early 1960s to study 
human-computer interaction. 

ELIZA followed rules that de- 
termined how to respond during 
a dialogue. The most convinc- 
ing results came from a rule set 
called DOCTOR that simulated a 
psychotherapist: By turning patients’ state- 
ments around as questions, the program 
coaxed them to do most of the talking. For 
instance, in response to a patient saying, “I 
feel helpless,’ the computer might respond, 
“Why do you think you feel that way?” (You 
can talk to ELIZA yourself at http://psych.ful- 
lerton.edu/mbirnbaum/psych101/Eliza.htm.) 

People engaged readily with ELIZA, 
perhaps more for its novelty than its con- 
versational skills, but AI researchers were 
unimpressed. “The idea that you could make 
a convincing AI system that didn’t really 
have any intelligence was seen as cheating,” 
says Terry Winograd, a computer scientist at 
Stanford University in Palo Alto, California, 
who was a Ph.D. student down the hall from 
Weizenbaum. This was a wildly optimistic 
time for the field, with many researchers an- 
ticipating computers with human-level gen- 
eral intelligence right around the corner. 

But work on artificial general intelligence 
didn’t pan out, and funding and interest 
dried up in what has come to be known as 
the “AI winter.” It wasn’t until the turn of the 
new millennium that mainstream interest in 
AI resurged, driven by advances in “narrow 
AI,’ focusing on specific problems such as 
voice recognition and machine vision. 
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Conversational “chatbots” such as ELIZA 
are still viewed as a parlor trick by most com- 
puter scientists (Science, 9 January, p. 116). 
But the chatbots are finding a new niche in 
clinical psychology. Their success may hinge 
on the very thing that AI researchers eschew: 
the ability of an unintelligent computer to 
trick people into believing that they are talk- 
ing to an intelligent, empathetic person. 


THAT ISN’T EASY, as Rizzo is keenly aware. 
What most often breaks the spell for a pa- 
tient conversing with Ellie isn’t the content 
of the conversation, because the computer 
hews closely to a script that Rizzo’s team 
based on traditional clinical therapy ses- 
sions. “The problem is entrainment,’ he 
says, referring to the way that humans sub- 
consciously track and mirror each other’s 
emotions during a conversation. 

For example, a patient might say to 
Ellie, “Today was not the best day,’ but the 
voice recognition software misses the 
“not.” So Ellie smiles and exclaims, “That’s 
great!” For an AI system striving to bond 
with a human patient and earn trust, Rizzo 
says, “that’s a disaster.” 


The goal is “dragging clinical psychology 


kicking and screaming into the 2Ist century.” 
Albert “Skip” Rizzo, University of Southern California 


To improve entrainment, a camera tracks a 
patient’s psychological signals: facial expres- 
sion, posture, hand movement, and voice 
dynamics. Ellie crunches those data in an at- 
tempt to gauge emotional state. 

The patterns can be subtle, says Louis- 
Philippe Morency, a computer scientist at 
USC who has led the development of the AI 
that underlies Ellie. For instance, he says, 
a person’s voice may shift “from breathy 
to tense.” The team devised algorithms to 
match patterns to a likely emotional state. 
It’s imperfect, he says, but “our experiments 
showed strong correlation with [a patient’s] 
psychological distress level.” 

Other patterns unfold over multiple ses- 
sions. For instance, the team’s work with 
US. veterans suffering from post-traumatic 
stress disorder (PTSD) revealed that “smile 
dynamics” are a strong predictor of depres- 
sion. The pattern is so subtle that it took 
a computer to detect it: Smiling frequency 
remained the same in depressed patients, 
on average, but the duration and intensity 
of their smiles was reduced. 

Even if Ellie were to achieve perfect en- 
trainment, Rizzo says, it “is really just an en- 
hanced ELIZA.” The AI under the hood can 
only sustain about a 20-minute conversation 
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before the spell breaks, which limits the sys- 
tem’s usefulness for diagnosis and treatment 
of most psychological problems. Without 
sophisticated natural language processing 
and semantic knowledge, Ellie will never 
fool people into believing that they are talk- 
ing to a human. But that’s okay, Rizzo says: 
Becoming too humanlike might backfire. 
One counterintuitive finding from Rizzo’s lab 
came from telling some patients that Ellie is 
a puppet controlled by a human while telling 
others she is fully autonomous. The patients 
told there was a puppeteer were less engaged 
and less willing to open up during therapy. 

That’s no surprise to AI researchers 
like Winograd. “This goes right back to 
ELIZA,” he says. “If you don’t feel judged, 
you open up.” 

Ethical and privacy issues may loom if AI 
therapy goes mainstream. Winograd worries 
that online services may not be forthcoming 
about whether there is a human in the loop. 
“There is a place for deceiving people for 
their own good, such as using placebos in 
medicine,’ he says. But when it comes to 
AI psychology, “you have to make it clear to 
people that they are talking to a machine 
and not a human.” 

If patients readily open up 
to a machine, will clinicians be 
needed at all? Rizzo is adamant 
that a human must always be 
involved because machines can- 
not genuinely empathize with 
patients. And Ellie, he points 
out, has a long way to go before being ready 
for prime time: The program does not yet 
have the ability to learn from individual pa- 
tients. Rizzo envisions AI systems as a way 
to gather baseline data, providing psycholo- 
gists with the equivalent of a standard bat- 
tery of blood tests. “The goal isn’t to replace 
people,” he says, “but to create tools for hu- 
man caregivers.” 

Helgadottir has a bolder vision. Although 
computers are not going to replace thera- 
pists anytime soon, she says, “I do believe 
that in some circumstances computerized 
therapy can be successful with no human 
intervention ... in many ways people are not 
well suited to be therapists.” A computer 
may be more probing and objective. 

Sverrisdottir’s experience suggests that 
CBTpsych, at least, can make a difference. 
Under the program’s tutelage, she says, 
“very slowly, I started to analyze myself 
when I’m amongst other people.” She identi- 
fied a pattern of “negative thoughts about 
people judging me.” 

She might have got there with a human 
therapist, she says. But in the years since she 
first started talking to a computer about the 
trouble swirling in her mind, Sverrisdotter 
says, “I have been able to change it.” m 
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Fears of an Al pioneer 


Stuart Russell argues that AI is as dangerous 
as nuclear WEdpoOns By John Bohannon 


rom the enraged robots in the 1920 

play R.U.R. to the homicidal computer 

H.A.L. in 2001: A Space Odyssey, sci- 

ence fiction writers have embraced the 

dark side of artificial intelligence (AI) 

ever since the concept entered our col- 
lective imagination. Sluggish progress in AI 
research, especially during the “AI winter” 
of the 1970s and 1980s, made such worries 
seem far-fetched. But recent breakthroughs 
in machine learning and vast improvements 
in computational power have brought a flood 
of research funding— 
and fresh concerns 
about where AI may 
lead us. 

One researcher now 
speaking up is Stuart 
Russell, a computer 
scientist at the Uni- 
versity of California, 
Berkeley, who with 
Peter Norvig, director 
of research at Google, wrote the premier AI 
textbook, Artificial Intelligence: A Modern 
Approach, now in its third edition. Last year, 
Russell joined the Centre for the Study of Ex- 
istential Risk at Cambridge University in the 
United Kingdom as an AI expert focusing on 
“risks that could lead to human extinction.” 
Among his chief concerns, which he aired at 
an April meeting in Geneva, Switzerland, run 
by the United Nations, is the danger of put- 
ting military drones and weaponry under the 


- "| 


252 17 JULY 2015 + VOL 349 ISSUE 6245 


full control of AI systems. This interview has 
been edited for clarity and brevity. 


Q: What do you see as a likely 

path from Al to disaster? 

A: The basic scenario is explicit or implicit 
value misalignment: AI systems [that are] 
given objectives that don’t take into ac- 
count all the elements that humans care 
about. The routes could be varied and 
complex—corporations seeking a super- 
technological advantage, countries trying to 
build [AI systems] before their enemies, or 
a slow-boiled frog kind of evolution leading 
to dependency and enfeeblement not un- 
like E. M. Forster’s The Machine Stops. 


Q: You've grappled with this 

issue for a long time. 

A: My textbook has a section “What If We 
Do Succeed?” devoted to the question of 
whether human-level AI or superintelligent 
systems would be a good idea. [More recent 
causes for concern are] the rapid develop- 
ments in AI capabilities such as the legged 
locomotion of Big Dog [the autonomous 
robot created by Boston Dynamics, recently 
acquired by Google] and progress in com- 
puter vision. 


Q: What needs to be done to 

prevent an Al catastrophe? 

A: First, research into the precise nature 
of the potential risk and development of 
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technical approaches to eliminate the risk. 
Second, modification of the goals of AI and 
the training of students so that alignment 
of AI systems with human objectives is 
central to the field, just as containment is 
central to the goals of fusion research. 


Q: But by the time we were developing 
nuclear fusion, we already had atomic 
bombs, whereas the Al threat seems 
speculative. Are we really at the “fusion” 
stage of Al research? 

A: Here’s what Leo Szilard wrote in 1939 
after demonstrating a [nuclear] chain 
reaction: “We switched everything off and 
went home. That night, there was very 
little doubt in my mind that the world 
was headed for grief.” To those who say, 
well, we may never get to human-level or 
superintelligent AI, I would reply: It’s like 
driving straight toward a cliff and saying, 
“Let’s hope I run out of gas soon!” 


Q: The intention with fission was to create 
a weapon. The intention with Al is to create 

a tool: intelligence on tap. Does that explain 
the reluctance to regulate Al? 

A: From the beginning, the primary inter- 
est in nuclear technology was the “inex- 
haustible supply of energy.” The possibility 
of weapons was also obvious. I think there 
is a reasonable analogy between unlimited 
amounts of energy and unlimited amounts 
of intelligence. Both seem wonderful until 
one thinks of the possible risks. In neither 
case will anyone regulate the mathemat- 
ics. The regulation of nuclear weapons 
deals with objects and materials, whereas 
with AI it will be a bewildering variety of 
software that we cannot yet describe. I’m 
not aware of any large movement calling 
for regulation either inside or outside AI, 
because we don’t know how to write such 
regulation. 


Q: Should we start tracking Al research 

as we track fissile material? Who should 
do the policing? 

A: I think the right approach is to build 
the issue directly into how practitioners 
define what they do. No one in civil 
engineering talks about “building bridges 
that don’t fall down.” They just call it 
“building bridges.” Essentially all fusion 
researchers work on containment as 

a matter of course; uncontained fusion 
reactions just aren’t useful. Right now 
we have to say “AI that is probably 
beneficial,’ but eventually that will just 
be called “AI.” [We must] redirect the 
field away from its current goal of building 
pure intelligence for its own sake, regard- 
less of the associated objectives and their 
consequences. & 
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POLICY FORUM 


Data, privacy, and the greater good 


Eric Horvitz'* and Deirdre Mulligan?* 


Large-scale aggregate analyses of anonymized data can yield valuable results and insights that 
address public health challenges and provide new avenues for scientific discovery. These methods 
can extend our knowledge and provide new tools for enhancing health and wellbeing. However, they 
raise questions about how to best address potential threats to privacy while reaping benefits for 
individuals and to society as a whole. The use of machine learning to make leaps across informational 
and social contexts to infer health conditions and risks from nonmedical data provides 
representative scenarios for reflections on directions with balancing innovation and regulation. 


hat if analyzing Twitter tweets or Face- 

book posts could identify new mothers 

at risk for postpartum depression (PPD)? 

Despite PPD’s serious consequences, early 

identification and prevention remain 
difficult. Absent a history of depression, detection 
is largely dependent on new mothers’ self-reports. 
But researchers found that shifts in sets of activ- 
ities and language usage on Facebook are pre- 
dictors of PPD (J) (see the photo). This is but one 
example of promising research that uses machine 
learning to derive and leverage health-related in- 
ferences from the massive flows of data about in- 
dividuals and populations generated through social 
media and other digital data streams. At the same 
time, machine learning presents new challenges 
for protecting individual privacy and ensuring fair 
use of data. We need to strike a new balance be- 
tween controls on collecting information and con- 
trols on how it is used, as well as pursue auditable 
and accountable technologies and systems that 
facilitate greater use-based privacy protections. 

Researchers have coined terms, such as digi- 
tal disease detection (2) and infodemiology (3), 
to define the new science of harnessing diverse 
streams of digital information to inform public 
health and policy, e.g., earlier identification of epi- 
demics, (4) modeling communicability and flow of 
illness (5), and stratifying individuals at risk for 
illness (6). This new form of health research can 
also inform and extend understandings drawn from 
traditional health records and human subjects re- 
search. For example, the detection of adverse drug 
reactions could be improved by jointly leveraging 
data from the U.S. Food and Drug Administration’s 
Adverse Event Reporting System and anonymized 
search logs (7). Search logs can serve as a large- 
scale sensing system that can be used for drug 
safety surveillance—pharmacovigilance. 
Infodemiology studies are typically large-scale 

aggregate analyses of anonymized data—publicly 
disclosed or privately held—that yield results and 
insights on public health questions across popula- 
tions. However, some methods and models can be 
aimed at making inferences about unique individ- 
uals that could drive actions, such as alerting or 
providing digital nudges, to improve individual 
or public health outcomes. 


lMicrosoft Research, Redmond, WA 98052, USA. “University 
of California, Berkeley, Berkeley, CA 94720, USA. 
*Corresponding author. E-mail: horvitz2@microsoft.com (E.H.); 
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Although digital nudging shows promise, a re- 
cent flare-up in the United Kingdom highlights 
the privacy concerns it can ignite. A Twitter suicide- 
prevention application called Good Samaritan 
monitored individuals’ tweets for words and 
phrases indicating a potential mental health crisis. 
The app notified the person’s followers so they 


Machine learning can make “category-jumping” infer- 
ences about health. New mother's activities and language 
usage on social media are predictors of postpartum 
depression. 


could intervene to avert a potential suicide. But 
the app was shuttered after public outcry drew reg- 
ulator concern (8). Critics worried the app would 
encourage online stalkers and bullies to target vul- 
nerable individuals and collected 1200 signatures 
on a petition arguing that the app breached users’ 
privacy by collecting, processing, and sharing sen- 
sitive information. Despite the developers’ laudable 
goal of preventing suicide, the nonprofit was chas- 
tised for playing fast and loose with the privacy and 
mental health of those it was seeking to save (9). 
Machine learning can facilitate leaps across infor- 
mational and social contexts, making “category- 
jumping” inferences about health conditions or 
propensities from nonmedical data generated far 
outside the medical context. The implications for 
privacy are profound. Category-jumping inferences 
may reveal attributes or conditions an individual has 
specifically withheld from others. To protect against 
such violations, the United States heavily regulates 
health care privacy. But, although information about 
health conditions garnered from health care treat- 
ment and payment must be handled in a manner 
that respects patient privacy, machine learning and in- 
ference can sidestep many of the existing protections. 


Even when not category-jumping, machine learning 
can be used to draw powerful and compromising in- 
ferences from self-disclosed, seemingly benign data or 
readily observed behavior. These inferences can un- 
dermine a basic goal of many privacy laws—to allow 
individuals to control who knows what about them. 
Machine learning and inference makes it increasingly 
difficult for individuals to understand what others can 
know about them based on what they have explicitly 
or implicitly shared. And these computer-generated 
channels of information about health conditions 
join other technically created fissures in existing legal 
protections for health privacy. In particular, it is dif- 
ficult to reliably deidentify publicly shared data sets, 
given the enormous amount and variety of ancillary 
data that can be used to reidentify individuals. 

The capacities of machine learning expose the fun- 
damental limitations of existing U.S. privacy rules that 
tie the privacy protection of an individual's health sta- 
tus to specific contexts or specific types of informa- 
tion a priori identified as health information. Health 
privacy regulations and privacy laws in the United 
States generally are based on the assumption 
that the semantics of data are relatively fixed 
and knowable and reside in isolated contexts. 
Machine learning techniques can instead be used 
to infer new meaning within and across contexts 
and is generally unencumbered by privacy rules 
in the United States. Using publicly available 
Twitter posts to infer risk of PPD, for example, 
does not run afoul of existing privacy law. This 
might be unsurprising, and seem unproblematic, 
given that the posts were publicly shared, but 
there are troubling consequences. 

Current privacy laws often do double duty. At a 
basic level, they limit who has access to infor- 
mation about a person. This implicitly limits the 
extent to which that information influences 
decision-making and thus doubles as a limit on 
the opportunities for information to fuel dis- 
crimination. Because of the heightened privacy 
sensitivities and concerns with health-related dis- 
crimination, we have additional laws that regulate 
the use of health information outside the health care 
context. U.S. laws specifically limit the use of some 
health information in ways considered unfair. For 
example, credit-reporting agencies are generally pro- 
hibited from providing medical information to make 
decisions about employment, credit, or housing. The 
Americans with Disabilities Act (ADA) prohibits dis- 
crimination on the basis of substantial physical or 
mental disabilities—or even a mistaken belief that an 
individual suffers from such a disability. If machine 
learning is used to infer that an individual suffers 
from a physical or mental impairment, an employer 
who bases a hiring decision on it, even if the in- 
ference is wrong, would violate the law. 

But the ADA does not prohibit discrimination based 
on predispositions for such disabilities (10). Machine 
learning might discover those, too. In theory, the 
Genetic Information Non-Discrimination Act (GINA) 
should fill this gap by protecting people genetically 
predisposed to a disease. But again, machine learn- 
ing exposes cracks in this protection. Although 
GINA prohibits discrimination based on informa- 
tion derived from genetic tests or a family history of 
a disease (1D), it does not limit the use of information 
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about such a disposition—even if it is grounded in 
genetics—inferred through machine learning tech- 
niques that mine other sorts of data. In other words, 
machine learning that predicts future health status 
from nongenetic information—including health status 
changes due to genetic predisposition—would cir- 
cumvent existing legal protections (2). 

Just as machine learning can expose secrets, it 
facilitates social sorting—placing individuals into 
categories for differential treatment—with good or 
bad intent and positive or negative outcomes. The 
methods used to classify individuals as part of ben- 
eficial public health programs and nudges can just 
as easily be used for more nefarious purposes, such 
as discrimination to protect organizational profits. 

Policy-makers in the United States and else- 
where are just beginning to address the challenges 
that machine learning and inference pose to com- 
mitments to privacy and equal treatment. Although 
not specifically focused on health information, 
reports issued by the White House—discuss the 
potential for large-scale data analyses to result in 
discrimination (13)—and the Federal Trade Com- 
mission (FTC) have suggested new efforts to pro- 
tect privacy, regulate harmful uses of information, 
and increase transparency. 

The FTC is the key agency policing unfair and 
deceptive practices in the commercial marketplace, 
including those that touch on the privacy and se- 
curity of personal information. Its proposed privacy 
framework encourages companies to combine tech- 
nical and policy mechanisms to protect against re- 
identification. The FTC’s proposed rules would 
work to ensure that data are both “not reasonably 
identifiable” and accompanied by public company 
commitments not to reidentify it. The same pri- 
vacy rules should apply to downstream users of the 
data (14). This approach is promising for machine 
learning and other areas of artificial intelligence 
that rely on data-centric analyses. It allows learning 
from large data sets—and sharing them—by encour- 
aging companies to reduce the risks that data pools 
and data sharing pose for individual privacy. 

The FTC proposal grows, in part, from recent agen- 
cy actions focused on inferences that we have deemed 
“context-jumping.” In one high-profile case, Netflix 
publicly released data sets to support a competition 
to improve their recommendation algorithm. When 
outside researchers used ancillary data to reidentify 
and infer sensitive attributes about individuals from 
the Netflix data sets, the FTC worked with the com- 
pany to limit future public disclosures—setting out 
the limits discussed above. In a similar vein, the FTC 
objected to a change in Facebook's defaults that ex- 
posed individuals’ group affiliations from which 
sensitive information, such as political views and 
sexual orientation, could be inferred (75). 

Additionally, the FTC has made efforts to ensure 
that individuals can control tracking in the online 
and mobile environments. These are in part due to 
the nonobvious inferences that can be drawn from 
vast collections of data (J6-18) and the subsequent 
risks to consumers, who may be placed in classif- 
ications that single them out for specific treatment 
in the marketplace (19, 20). In a related context, 
the FTC recommended that Congress require data 
brokers—companies that collect consumers’ personal 


254 17 JULY 2015 + VOL 349 ISSUE 6245 


information and resell or share that information 
with others—to clearly disclose to consumers in- 
formation about the data they collect, as well as 
the fact that they derive inferences from it (27). 
Here, too, the FTC appears concerned with not just 
the raw data, but inferences from its analysis. 

The Obama Administration’s Big Data Initiative 
has also considered the risks to privacy posed by 
machine learning and the potential downsides of 
using machine inferences in the commercial mar- 
ketplace (22, 23), concluding that we need to up- 
date our privacy rules, increase technical expertise 
in consumer protection and civil rights agencies to 
address novel discrimination issues arising from 
big data, provide individuals with privacy preserv- 
ing tools that allow the to control the collection and 
manage the use of personal information, as well as 
increase transparency into how companies use and 
trade data. The Administration is also concerned 
with the use of machine learning in policing and 
national security. The White House report called for 
increased technical expertise to help civil rights and 
consumer protection agencies identify, investigate, 
and resolve uses of big data analytics that have a 
discriminatory impact on protected classes (24). 

Note that reports and proposals from the Ad- 
ministration distinctly emphasize policies and reg- 
ulations focused on data use rather than collection. 
While acknowledging the need for tools that allow 
consumers to control when and how their data is 
collected, the Administration recommendations fo- 
cus on empowering individuals to participate in 
decisions about future uses and disclosures of col- 
lected data (25). A separate report by the President's 
Council of Advisors on Science and Technology 
(PCAST) concluded that this was a more fruitful di- 
rection for technical protections. Both reports sug- 
gest that use-based protections better address the 
latent meaning of data—inferences drawn from 
data using machine learning—and can adapt to the 
scale of the data-rich and connected environment of 
the future (26). The Administration called for col- 
laborative efforts to ensure that regulations in the 
health context will allow society to reap the benefits 
and mitigate the risks posed by machine learning 
and inferences. Use-based approaches are often fav- 
ored by industry, as well, which tends to view data 
as akin to a natural resource to be mined for com- 
mercial and public benefit, and industry is resistant 
to efforts to constrain data collection. 

Although incomplete and unlikely to be acted 
upon by the current gridlocked Congress, adop- 
tion of these recommendations would increase 
transparency about data’s collection, use, and con- 
sequences. Along with efforts to identify and con- 
strain discriminatory or unfair uses of data and 
inferences, they are promising steps. They also align 
with aspects of existing European privacy laws 
concerned with the transparency and fairness of 
data processing, particularly the risks to indi- 
viduals of purely automated decision-making. 

Current European Union (EU) law requires en- 
tities to provide individuals with access to the data 
on which decisions are rendered, as well as infor- 
mation about decision criteria [see Articles 12 and 
15 of (2). Although currently governed by a Europe- 
wide directive, both provisions are a matter of na- 


tional law. What exactly individuals receive when 
they request access to their data and to processing 
logic varies by country, as does the implementation 
of the limitation on “purely automated” processing. 
The EU is expected to adopt a data privacy regula- 
tion that will supplant local law, with a single na- 
tional standard. Although the current draft includes 
parallel provisions, their final form is not yet known 
nor is how they will ultimately be interpreted (27). 
In theory, a new EU requirement to disclose the 
logic of processing could apply quite broadly, with 
implications for public access to data analytics and 
algorithms. In the interim, a decision expected this 
summer in a case before the European Court of 
Justice may provide some detail as to what level 
of access to both data and the logic of processing is 
currently required under the EU Directive (28). 

Improving the transparency of data processing 
to data subjects is both important and challenging. 
Although the goal may be to promote actual under- 
standing of the workings or likely outputs of ma- 
chine learning and reasoning methods, the workflows 
and dynamism of algorithms and decision criteria 
may be difficult to characterize and explain. For ex- 
ample, popular convolutional neural-network learn- 
ing procedures (commonly referred to as “deep 
learning”) automatically induce rich, multilayered 
representations that their developers themselves 
may not understand with clarity. Although high- 
level descriptions of procedures and representations 
might be provided, even an accomplished program- 
mer with access to the source code would be un- 
able to describe the precise operation of such a 
system or predict the output of a given set of inputs. 

Data’s meaning has become a moving target. 
Data sets can be easily combined to reidentify data 
sets thought deidentified, and sensitive knowledge 
can be inferred from benign data that are routine- 
ly and promiscuously shared. These pose difficul- 
ties for current U.S. legal approaches to privacy 
protection that regulate data on the basis of its 
identifiability and express meaning. 

Use-based approaches are driven, in part, by the 
realization that focusing solely on limiting data col- 
lection is inadequate. In a way, this presupposes 
that data are an unalloyed good that should be 
collected on principle, whenever and wherever pos- 
sible. Whereas we are not ready to abandon limits on 
data collection, we agree that use-based regulations, 
although challenging to implement, are an impor- 
tant part of the future legal landscape—and will 
help to advance privacy, equality, and the public 
good. To advance transparency and to balance the 
constraints they impose, use-based approaches would 
need to emphasize access, accuracy, and correction 
rights for individuals. 

The evolution of regulations for health informa- 
tion, although incomplete, provides a useful map for 
thinking about the challenges and opportunities we 
face today and frames potential solutions. In health 
care, privacy rules were joined by nondiscrimination 
rules and always were accompanied by special pro- 
visions to support research. Today, they are being 
joined by collective governance models designed 
to encourage pooling of data in biobanks that sup- 
port research on health conditions while protecting 
collective interests in privacy. 
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Despite practical challenges, we are hopeful that 
informed discussions among policy-makers and the 
public about data and the capabilities of machine 
learning, will lead to insightful designs of programs 
and policies that can balance the goals of protecting 
privacy and ensuring fairness with those of reaping 
the benefits to scientific research and to individual 
and public health. Our commitments to privacy and 
fairness are evergreen, but our policy choices must 
adapt to advance them, and support new tech- 
niques for deepening our knowledge. 
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Machine learning: Trends, 
perspectives, and prospects 


M. I. Jordan’ and T. M. Mitchell?* 


Machine learning addresses the question of how to build computers that improve 
automatically through experience. It is one of today’s most rapidly growing technical fields, 
lying at the intersection of computer science and statistics, and at the core of artificial 
intelligence and data science. Recent progress in machine learning has been driven both by 
the development of new learning algorithms and theory and by the ongoing explosion in the 
availability of online data and low-cost computation. The adoption of data-intensive 
machine-learning methods can be found throughout science, technology and commerce, 
leading to more evidence-based decision-making across many walks of life, including 
health care, manufacturing, education, financial modeling, policing, and marketing. 


achine learning is a discipline focused 

on two interrelated questions: How can 

one construct computer systems that auto- 

matically improve through experience? 

and What are the fundamental statistical- 
computational-information-theoretic laws that 
govern all learning systems, including computers, 
humans, and organizations? The study of machine 
learning is important both for addressing these 
fundamental scientific and engineering ques- 
tions and for the highly practical computer soft- 
ware it has produced and fielded across many 
applications. 

Machine learning has progressed dramati- 
cally over the past two decades, from laboratory 
curiosity to a practical technology in widespread 
commercial use. Within artificial intelligence (AI), 
machine learning has emerged as the method 
of choice for developing practical software for 
computer vision, speech recognition, natural lan- 
guage processing, robot control, and other ap- 
plications. Many developers of AI systems now 
recognize that, for many applications, it can be 
far easier to train a system by showing it exam- 
ples of desired input-output behavior than to 
program it manually by anticipating the desired 
response for all possible inputs. The effect of ma- 
chine learning has also been felt broadly across 
computer science and across a range of indus- 
tries concerned with data-intensive issues, such 
as consumer services, the diagnosis of faults in 
complex systems, and the control of logistics 
chains. There has been a similarly broad range of 
effects across empirical sciences, from biology to 
cosmology to social science, as machine-learning 
methods have been developed to analyze high- 
throughput experimental data in novel ways. See 
Fig. 1 for a depiction of some recent areas of ap- 
plication of machine learning. 

A learning problem can be defined as the 
problem of improving some measure of perform- 
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ance when executing some task, through some 
type of training experience. For example, in learn- 
ing to detect credit-card fraud, the task is to as- 
sign a label of “fraud” or “not fraud” to any given 
credit-card transaction. The performance metric 
to be improved might be the accuracy of this 
fraud classifier, and the training experience might 
consist of a collection of historical credit-card 
transactions, each labeled in retrospect as fraud- 
ulent or not. Alternatively, one might define a 
different performance metric that assigns a higher 
penalty when “fraud” is labeled “not fraud” than 
when “not fraud” is incorrectly labeled “fraud.” 
One might also define a different type of training 
experience—for example, by including unlab- 
eled credit-card transactions along with labeled 
examples. 

A diverse array of machine-learning algorithms 
has been developed to cover the wide variety of 
data and problem types exhibited across differ- 
ent machine-learning problems (J, 2). Conceptual- 
ly, machine-learning algorithms can be viewed as 
searching through a large space of candidate 
programs, guided by training experience, to find 
a program that optimizes the performance metric. 
Machine-learning algorithms vary greatly, in part 
by the way in which they represent candidate 
programs (e.g., decision trees, mathematical func- 
tions, and general programming languages) and in 
part by the way in which they search through this 
space of programs (e.g., optimization algorithms 
with well-understood convergence guarantees 
and evolutionary search methods that evaluate 
successive generations of randomly mutated pro- 
grams). Here, we focus on approaches that have 
been particularly successful to date. 

Many algorithms focus on function approxi- 
mation problems, where the task is embodied 
in a function (€.g., given an input transaction, out- 
put a “fraud” or “not fraud” label), and the learn- 
ing problem is to improve the accuracy of that 
function, with experience consisting of a sample 
of known input-output pairs of the function. In 
some cases, the function is represented explicit- 
ly as a parameterized functional form; in other 
cases, the function is implicit and obtained via a 
search process, a factorization, an optimization 
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Despite practical challenges, we are hopeful that 
informed discussions among policy-makers and the 
public about data and the capabilities of machine 
learning, will lead to insightful designs of programs 
and policies that can balance the goals of protecting 
privacy and ensuring fairness with those of reaping 
the benefits to scientific research and to individual 
and public health. Our commitments to privacy and 
fairness are evergreen, but our policy choices must 
adapt to advance them, and support new tech- 
niques for deepening our knowledge. 
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fundamental scientific and engineering ques- 
tions and for the highly practical computer soft- 
ware it has produced and fielded across many 
applications. 

Machine learning has progressed dramati- 
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curiosity to a practical technology in widespread 
commercial use. Within artificial intelligence (AI), 
machine learning has emerged as the method 
of choice for developing practical software for 
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plications. Many developers of AI systems now 
recognize that, for many applications, it can be 
far easier to train a system by showing it exam- 
ples of desired input-output behavior than to 
program it manually by anticipating the desired 
response for all possible inputs. The effect of ma- 
chine learning has also been felt broadly across 
computer science and across a range of indus- 
tries concerned with data-intensive issues, such 
as consumer services, the diagnosis of faults in 
complex systems, and the control of logistics 
chains. There has been a similarly broad range of 
effects across empirical sciences, from biology to 
cosmology to social science, as machine-learning 
methods have been developed to analyze high- 
throughput experimental data in novel ways. See 
Fig. 1 for a depiction of some recent areas of ap- 
plication of machine learning. 
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ance when executing some task, through some 
type of training experience. For example, in learn- 
ing to detect credit-card fraud, the task is to as- 
sign a label of “fraud” or “not fraud” to any given 
credit-card transaction. The performance metric 
to be improved might be the accuracy of this 
fraud classifier, and the training experience might 
consist of a collection of historical credit-card 
transactions, each labeled in retrospect as fraud- 
ulent or not. Alternatively, one might define a 
different performance metric that assigns a higher 
penalty when “fraud” is labeled “not fraud” than 
when “not fraud” is incorrectly labeled “fraud.” 
One might also define a different type of training 
experience—for example, by including unlab- 
eled credit-card transactions along with labeled 
examples. 

A diverse array of machine-learning algorithms 
has been developed to cover the wide variety of 
data and problem types exhibited across differ- 
ent machine-learning problems (J, 2). Conceptual- 
ly, machine-learning algorithms can be viewed as 
searching through a large space of candidate 
programs, guided by training experience, to find 
a program that optimizes the performance metric. 
Machine-learning algorithms vary greatly, in part 
by the way in which they represent candidate 
programs (e.g., decision trees, mathematical func- 
tions, and general programming languages) and in 
part by the way in which they search through this 
space of programs (e.g., optimization algorithms 
with well-understood convergence guarantees 
and evolutionary search methods that evaluate 
successive generations of randomly mutated pro- 
grams). Here, we focus on approaches that have 
been particularly successful to date. 

Many algorithms focus on function approxi- 
mation problems, where the task is embodied 
in a function (€.g., given an input transaction, out- 
put a “fraud” or “not fraud” label), and the learn- 
ing problem is to improve the accuracy of that 
function, with experience consisting of a sample 
of known input-output pairs of the function. In 
some cases, the function is represented explicit- 
ly as a parameterized functional form; in other 
cases, the function is implicit and obtained via a 
search process, a factorization, an optimization 
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procedure, or a simulation-based procedure. Even 
when implicit, the function generally depends 
on parameters or other tunable degrees of free- 
dom, and training corresponds to finding values 
for these parameters that optimize the perform- 
ance metric. 

Whatever the learning algorithm, a key scien- 
tific and practical goal is to theoretically character- 
ize the capabilities of specific learning algorithms 
and the inherent difficulty of any given learning 
problem: How accurately can the algorithm learn 
from a particular type and volume of training 
data? How robust is the algorithm to errors in its 
modeling assumptions or to errors in the train- 
ing data? Given a learning problem with a given 
volume of training data, is it possible to design a 
successful algorithm or is this learning problem 
fundamentally intractable? Such theoretical char- 
| Characters) acterizations of machine-learning algorithms and 

WATE problems typically make use of the familiar frame- 
works of statistical decision theory and compu- 
tational complexity theory. In fact, attempts to 
characterize machine-learning algorithms the- 
oretically have led to blends of statistical and 
computational theory in which the goal is to simul- 
taneously characterize the sample complexity 
(how much data are required to learn accurately) 
and the computational complexity (how much 
computation is required) and to specify how these 
depend on features of the learning algorithm such 
as the representation it uses for what it learns 
(3-6). A specific form of computational analysis 
that has proved particularly useful in recent 
years has been that of optimization theory, 
with upper and lower bounds on rates of con- 
vergence of optimization procedures merging 
well with the formulation of machine-learning 
problems as the optimization of a performance 
: metric (7, 8). 
aa As a field of study, machine learning sits at the 
crossroads of computer science, statistics and a 
variety of other disciplines concerned with auto- 
matic improvement over time, and inference and 
decision-making under uncertainty. Related dis- 
ciplines include the psychological study of human 
learning, the study of evolution, adaptive control 
theory, the study of educational practices, neuro- 
science, organizational behavior, and economics. 
Although the past decade has seen increased cross- 
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Drivers of machine-learning progress 


The past decade has seen rapid growth in the 
ability of networked and mobile computing sys- 
tems to gather and transport vast amounts of 
data, a phenomenon often referred to as “Big 
Data.” The scientists and engineers who collect 
Fig. 1. Applications of machine learning. Machine learning is having a substantial effect on many such data have often turned to machine learn- 
areas of technology and science; examples of recent applied success stories include robotics and | ing for solutions to the problem of obtaining 
autonomous vehicle control (top left), speech processing and natural language processing (top | useful insights, predictions, and decisions from 
right), neuroscience research (middle), and applications in computer vision (bottom). [The middle such data sets. Indeed, the sheer size of the data 
panel is adapted from (29). The images in the bottom panel are from the ImageNet database; object | makes it essential to develop scalable proce- 
recognition annotation is by R. Girshick.] dures that blend computational and statistical 
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considerations, but the issue is more than the 
mere size of modern data sets; it is the granular, 
personalized nature of much of these data. Mo- 
bile devices and embedded computing permit 
large amounts of data to be gathered about in- 
dividual humans, and machine-learning algo- 
rithms can learn from these data to customize 
their services to the needs and circumstances 
of each individual. Moreover, these personalized 
services can be connected, so that an overall ser- 
vice emerges that takes advantage of the wealth 
and diversity of data from many individuals 
while still customizing to the needs and circum- 
stances of each. Instances of this trend toward 
capturing and mining large quantities of data to 
improve services and productivity can be found 
across many fields of commerce, science, and 
government. Historical medical records are used 
to discover which patients will respond best 
to which treatments; historical traffic data are 
used to improve traffic control and reduce con- 
gestion; historical crime data are used to help 
allocate local police to specific locations at spe- 
cific times; and large experimental data sets are 
captured and curated to accelerate progress in 
biology, astronomy, neuroscience, and other data- 
intensive empirical sciences. We appear to be at 
the beginning of a decades-long trend toward in- 
creasingly data-intensive, evidence-based decision- 
making across many aspects of science, commerce, 
and government. 

With the increasing prominence of large-scale 
data in all areas of human endeavor has come a 
wave of new demands on the underlying machine- 
learning algorithms. For example, huge data sets 
require computationally tractable algorithms, high- 
ly personal data raise the need for algorithms 
that minimize privacy effects, and the availabil- 
ity of huge quantities of unlabeled data raises 
the challenge of designing learning algorithms 
to take advantage of it. The next sections survey 
some of the effects of these demands on recent 


work in machine-learning algorithms, theory, and 
practice. 


Core methods and recent progress 


The most widely used machine-learning methods 
are supervised learning methods (1). Supervised 
learning systems, including spam classifiers of 
e-mail, face recognizers over images, and med- 
ical diagnosis systems for patients, all exemplify 
the function approximation problem discussed 
earlier, where the training data take the form of 
a collection of (2, y) pairs and the goal is to 
produce a prediction y* in response to a query 
x*, The inputs x may be classical vectors or they 
may be more complex objects such as documents, 
images, DNA sequences, or graphs. Similarly, 
many different kinds of output zy have been studied. 
Much progress has been made by focusing on 
the simple binary classification problem in which 
y takes on one of two values (for example, “spam” 
or “not spam”), but there has also been abun- 
dant research on problems such as multiclass 
classification (where y takes on one of K labels), 
multilabel classification (where y is labeled simul- 
taneously by several of the K labels), ranking 
problems (where y provides a partial order on 
some set), and general structured prediction 
problems (where y is a combinatorial object such 
as a graph, whose components may be required 
to satisfy some set of constraints). An example 
of the latter problem is part-of-speech tagging, 
where the goal is to simultaneously label every 
word in an input sentence 2 as being a noun, 
verb, or some other part of speech. Supervised 
learning also includes cases in which y has real- 
valued components or a mixture of discrete and 
real-valued components. 

Supervised learning systems generally form 
their predictions via a learned mapping /(2), 
which produces an output y for each input 2 (or 
a probability distribution over y given x). Many 
different forms of mapping f/f exist, including 


decision trees, decision forests, logistic regres- 
sion, support vector machines, neural networks, 
kernel machines, and Bayesian classifiers (1). A 
variety of learning algorithms has been proposed 
to estimate these different types of mappings, and 
there are also generic procedures such as boost- 
ing and multiple kernel learning that combine 
the outputs of multiple learning algorithms. 
Procedures for learning f from data often make 
use of ideas from optimization theory or numer- 
ical analysis, with the specific form of machine- 
learning problems (e.g., that the objective function 
or function to be integrated is often the sum over 
a large number of terms) driving innovations. This 
diversity of learning architectures and algorithms 
reflects the diverse needs of applications, with 
different architectures capturing different kinds 
of mathematical structures, offering different lev- 
els of amenability to post-hoc visualization and 
explanation, and providing varying trade-offs 
between computational complexity, the amount 
of data, and performance. 

One high-impact area of progress in supervised 
learning in recent years involves deep networks, 
which are multilayer networks of threshold units, 
each of which computes some simple param- 
eterized function of its inputs (9, 10). Deep learning 
systems make use of gradient-based optimiza- 
tion algorithms to adjust parameters throughout 
such a multilayered network based on errors at 
its output. Exploiting modern parallel comput- 
ing architectures, such as graphics processing 
units originally developed for video gaming, it 
has been possible to build deep learning sys- 
tems that contain billions of parameters and 
that can be trained on the very large collections 
of images, videos, and speech samples available 
on the Internet. Such large-scale deep learning 
systems have had a major effect in recent years 
in computer vision (11) and speech recognition 
(12), where they have yielded major improve- 
ments in performance over previous approaches 
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Fig. 2. Automatic generation of text captions for images with deep networks. A convolutional neural network is trained to interpret images, and its 
output is then used by a recurrent neural network trained to generate a text caption (top). The sequence at the bottom shows the word-by-word focus of 
the network on different parts of input image while it generates the caption word-by-word. [Adapted with permission from (30)] 
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Fig. 3. Topic models. Topic modeling is a methodology for analyzing documents, where a document is viewed as a collection of words, and the words in 
the document are viewed as being generated by an underlying set of topics (denoted by the colors in the figure). Topics are probability distributions 
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inferred based on the analysis of a collection of documents and can be viewed to classify, index, and summarize the content of documents. [From (31). 
Copyright 2012, Association for Computing Machinery, Inc. Reprinted with permission] 


(see Fig. 2). Deep network methods are being 
actively pursued in a variety of additional appli- 
cations from natural language translation to 
collaborative filtering. 

The internal layers of deep networks can be 
viewed as providing learned representations of 
the input data. While much of the practical suc- 
cess in deep learning has come from supervised 
learning methods for discovering such repre- 
sentations, efforts have also been made to devel- 
op deep learning algorithms that discover useful 
representations of the input without the need for 
labeled training data (13). The general problem is 
referred to as unsupervised learning, a second 
paradigm in machine-learning research (2). 

Broadly, unsupervised learning generally in- 
volves the analysis of unlabeled data under as- 
sumptions about structural properties of the 
data (e.g., algebraic, combinatorial, or probabi- 
listic). For example, one can assume that data 
lie on a low-dimensional manifold and aim to 
identify that manifold explicitly from data. Di- 
mension reduction methods—including prin- 
cipal components analysis, manifold learning, 
factor analysis, random projections, and autoen- 
coders (J, 2)—make different specific assump- 
tions regarding the underlying manifold (e.g., 
that it is a linear subspace, a smooth nonlinear 
manifold, or a collection of submanifolds). An- 
other example of dimension reduction is the 
topic modeling framework depicted in Fig. 3. 
A criterion function is defined that embodies 
these assumptions—often making use of general 
statistical principles such as maximum like- 
lihood, the method of moments, or Bayesian 
integration—and optimization or sampling algo- 
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rithms are developed to optimize the criterion. 
As another example, clustering is the problem 
of finding a partition of the observed data (and 
a rule for predicting future data) in the absence 
of explicit labels indicating a desired partition. 
A wide range of clustering procedures has been 
developed, all based on specific assumptions 
regarding the nature of a “cluster.” In both clus- 
tering and dimension reduction, the concern 
with computational complexity is paramount, 
given that the goal is to exploit the particularly 
large data sets that are available if one dis- 
penses with supervised labels. 

A third major machine-learning paradigm is 
reinforcement learning (/4, 15). Here, the infor- 
mation available in the training data is inter- 
mediate between supervised and unsupervised 
learning. Instead of training examples that in- 
dicate the correct output for a given input, the 
training data in reinforcement learning are as- 
sumed to provide only an indication as to whether 
an action is correct or not; if an action is incor- 
rect, there remains the problem of finding the 
correct action. More generally, in the setting of 
sequences of inputs, it is assumed that reward 
signals refer to the entire sequence; the assign- 
ment of credit or blame to individual actions in the 
sequence is not directly provided. Indeed, although 
simplified versions of reinforcement learning 
known as bandit problems are studied, where it 
is assumed that rewards are provided after each 
action, reinforcement learning problems typically 
involve a general control-theoretic setting in 
which the learning task is to learn a control strat- 
egy (a “policy”) for an agent acting in an unknown 
dynamical environment, where that learned strat- 


egy is trained to chose actions for any given state, 
with the objective of maximizing its expected re- 
ward over time. The ties to research in control 
theory and operations research have increased 
over the years, with formulations such as Markov 
decision processes and partially observed Mar- 
kov decision processes providing points of con- 
tact (15, 16). Reinforcement-learning algorithms 
generally make use of ideas that are familiar 
from the control-theory literature, such as policy 
iteration, value iteration, rollouts, and variance 
reduction, with innovations arising to address 
the specific needs of machine learning (e.g., large- 
scale problems, few assumptions about the un- 
known dynamical environment, and the use of 
supervised learning architectures to represent 
policies). It is also worth noting the strong ties 
between reinforcement learning and many dec- 
ades of work on learning in psychology and 
neuroscience, one notable example being the 
use of reinforcement learning algorithms to pre- 
dict the response of dopaminergic neurons in 
monkeys learning to associate a stimulus light 
with subsequent sugar reward (17). 

Although these three learning paradigms help 
to organize ideas, much current research involves 
blends across these categories. For example, semi- 
supervised learning makes use of unlabeled data 
to augment labeled data in a supervised learning 
context, and discriminative training blends ar- 
chitectures developed for unsupervised learning 
with optimization formulations that make use 
of labels. Model selection is the broad activity of 
using training data not only to fit a model but 
also to select from a family of models, and the 
fact that training data do not directly indicate 
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which model to use leads to the use of algo- 
rithms developed for bandit problems and to 
Bayesian optimization procedures. Active learn- 
ing arises when the learner is allowed to choose 
data points and query the trainer to request tar- 
geted information, such as the label of an other- 
wise unlabeled example. Causal modeling is the 
effort to go beyond simply discovering predictive 
relations among variables, to distinguish which 
variables causally influence others (e.g. a high 
white-blood-cell count can predict the existence 
of an infection, but it is the infection that causes 
the high white-cell count). Many issues influence 
the design of learning algorithms across all of 
these paradigms, including whether data are 
available in batches or arrive sequentially over 
time, how data have been sampled, require- 
ments that learned models be interpretable by 
users, and robustness issues that arise when 
data do not fit prior modeling assumptions. 


Emerging trends 


The field of machine learning is sufficiently young 
that it is still rapidly expanding, often by invent- 
ing new formalizations of machine-learning 
problems driven by practical applications. (An 
example is the development of recommendation 
systems, as described in Fig. 4.) One major trend 
driving this expansion is a growing concern with 
the environment in which a machine-learning 
algorithm operates. The word “environment” 
here refers in part to the computing architecture; 
whereas a classical machine-learning system in- 
volved a single program running on a single ma- 
chine, it is now common for machine-learning 
systems to be deployed in architectures that in- 
clude many thousands or ten of thousands of 
processors, such that communication constraints 
and issues of parallelism and distributed pro- 
cessing take center stage. Indeed, as depicted 
in Fig. 5, machine-learning systems are increas- 
ingly taking the form of complex collections of 
software that run on large-scale parallel and dis- 
tributed computing platforms and provide a range 
of algorithms and services to data analysts. 

The word “environment” also refers to the 
source of the data, which ranges from a set of 
people who may have privacy or ownership con- 
cerns, to the analyst or decision-maker who may 
have certain requirements on a machine-learning 
system (for example, that its output be visual- 
izable), and to the social, legal, or political frame- 
work surrounding the deployment of a system. 
The environment also may include other machine- 
learning systems or other agents, and the overall 
collection of systems may be cooperative or ad- 
versarial. Broadly speaking, environments pro- 
vide various resources to a learning algorithm 
and place constraints on those resources. Increas- 
ingly, machine-learning researchers are formalizing 
these relationships, aiming to design algorithms 
that are provably effective in various environ- 
ments and explicitly allow users to express and 
control trade-offs among resources. 

As an example of resource constraints, let us 
suppose that the data are provided by a set of 
individuals who wish to retain a degree of pri- 
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vacy. Privacy can be formalized via the notion of 
“differential privacy,” which defines a probabi- 
listic channel between the data and the outside 
world such that an observer of the output of the 
channel cannot infer reliably whether particular 
individuals have supplied data or not (78). Clas- 
sical applications of differential privacy have 
involved insuring that queries (e.g., “what is the 
maximum balance across a set of accounts?”) to 
a privatized database return an answer that is 
close to that returned on the nonprivate data. 
Recent research has brought differential privacy 
into contact with machine learning, where que- 
ries involve predictions or other inferential asser- 
tions (e.g., “given the data I've seen so far, what is 
the probability that a new transaction is fraud- 
ulent?”) (19, 20). Placing the overall design of a 
privacy-enhancing machine-learning system 
within a decision-theoretic framework provides 
users with a tuning knob whereby they can choose 
a desired level of privacy that takes into account 
the kinds of questions that will be asked of the 
data and their own personal utility for the an- 
swers. For example, a person may be willing to 


WARK 


Fig. 4. Recommendation systems. A recommen- 
dation system is a machine-learning system that is 
based on data that indicate links between a set 
of a users (e.g., people) and a set of items (e.g., 
products). A link between a user and a product 
means that the user has indicated an interest in 
the product in some fashion (perhaps by purchas- 
ing that item in the past). The machine-learning prob- 
lem is to suggest other items to a given user that he 
or she may also be interested in, based on the data 
across all users. 


reveal most of their genome in the context of 
research on a disease that runs in their family 
but may ask for more stringent protection if in- 
formation about their genome is being used to 
set insurance rates. 

Communication is another resource that needs 
to be managed within the overall context of a 
distributed learning system. For example, data 
may be distributed across distinct physical loca- 
tions because their size does not allow them to 
be aggregated at a single site or because of ad- 
ministrative boundaries. In such a setting, we may 
wish to impose a bit-rate communication con- 
straint on the machine-learning algorithm. Solving 
the design problem under such a constraint will 
generally show how the performance of the learn- 
ing system degrades under decrease in commu- 
nication bandwidth, but it can also reveal how 
the performance improves as the number of dis- 
tributed sites (e.g., machines or processors) in- 
creases, trading off these quantities against the 
amount of data (27, 22). Much as in classical in- 
formation theory, this line of research aims at 
fundamental lower bounds on achievable per- 
formance and specific algorithms that achieve 
those lower bounds. 

A major goal of this general line of research is 
to bring the kinds of statistical resources studied 
in machine learning (e.g., number of data points, 
dimension of a parameter, and complexity of a 
hypothesis class) into contact with the classical 
computational resources of time and space. Such 
a bridge is present in the “probably approximately 
correct” (PAC) learning framework, which studies 
the effect of adding a polynomial-time compu- 
tation constraint on this relationship among error 
rates, training data size, and other parameters of 
the learning algorithm (3). Recent advances in 
this line of research include various lower bounds 
that establish fundamental gaps in performance 
achievable in certain machine-learning prob- 
lems (e.g., Sparse regression and sparse princi- 
pal components analysis) via polynomial-time 
and exponential-time algorithms (23). The core 
of the problem, however, involves time-data trade- 
offs that are far from the polynomial/exponential 
boundary. The large data sets that are increas- 
ingly the norm require algorithms whose time 
and space requirements are linear or sublinear in 
the problem size Number of data points or num- 
ber of dimensions). Recent research focuses on 
methods such as subsampling, random projec- 
tions, and algorithm weakening to achieve scal- 
ability while retaining statistical control (24, 25). 
The ultimate goal is to be able to supply time 
and space budgets to machine-learning systems 
in addition to accuracy requirements, with the 
system finding an operating point that allows 
such requirements to be realized. 


Opportunities and challenges 


Despite its practical and commercial successes, 
machine learning remains a young field with 
many underexplored research opportunities. 
Some of these opportunities can be seen by con- 
trasting current machine-learning approaches 
to the types of learning we observe in naturally 
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Fig. 5. Data analytics stack. Scalable machine-learning systems are layered architectures that are 
built on parallel and distributed computing platforms. The architecture depicted here—an open- 
source data analysis stack developed in the Algorithms, Machines and People (AMP) Laboratory at 
the University of California, Berkeley—includes layers that interface to underlying operating systems; 
layers that provide distributed storage, data management, and processing; and layers that provide 
core machine-learning competencies such as streaming, subsampling, pipelines, graph processing, 


and model serving. 


occurring systems such as humans and other 
animals, organizations, economies, and biological 
evolution. For example, whereas most machine- 
learning algorithms are targeted to learn one 
specific function or data model from one single 
data source, humans clearly learn many differ- 
ent skills and types of knowledge, from years 
of diverse training experience, supervised and 
unsupervised, in a simple-to-more-difficult se- 
quence (e.g., learning to crawl, then walk, then 
run). This has led some researchers to begin 
exploring the question of how to construct com- 
puter lifelong or never-ending learners that op- 
erate nonstop for years, learning thousands of 
interrelated skills or functions within an over- 
all architecture that allows the system to im- 
prove its ability to learn one skill based on 
having learned another (26-28). Another aspect 
of the analogy to natural learning systems sug- 
gests the idea of team-based, mixed-initiative 
learning. For example, whereas current machine- 
learning systems typically operate in isolation 
to analyze the given data, people often work 
in teams to collect and analyze data (e.g., biol- 
ogists have worked as teams to collect and an- 
alyze genomic data, bringing together diverse 
experiments and perspectives to make progress 
on this difficult problem). New machine-learning 
methods capable of working collaboratively with 
humans to jointly analyze complex data sets 
might bring together the abilities of machines 
to tease out subtle statistical regularities from 
massive data sets with the abilities of humans to 
draw on diverse background knowledge to gen- 
erate plausible explanations and suggest new 
hypotheses. Many theoretical results in machine 
learning apply to all learning systems, whether 
they are computer algorithms, animals, organ- 
izations, or natural evolution. As the field pro- 
gresses, we may see machine-learning theory 
and algorithms increasingly providing models 
for understanding learning in neural systems, 
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organizations, and biological evolution and see 
machine learning benefit from ongoing studies 
of these other types of learning systems. 

As with any powerful technology, machine 
learning raises questions about which of its po- 
tential uses society should encourage and dis- 
courage. The push in recent years to collect new 
kinds of personal data, motivated by its eco- 
nomic value, leads to obvious privacy issues, as 
mentioned above. The increasing value of data 
also raises a second ethical issue: Who will have 
access to, and ownership of, online data, and who 
will reap its benefits? Currently, much data are 
collected by corporations for specific uses leading 
to improved profits, with little or no motive for 
data sharing. However, the potential benefits that 
society could realize, even from existing online 
data, would be considerable if those data were to 
be made available for public good. 

To illustrate, consider one simple example 
of how society could benefit from data that is 
already online today by using this data to de- 
crease the risk of global pandemic spread from 
infectious diseases. By combining location data 
from online sources (e.g., location data from cell 
phones, from credit-card transactions at retail 
outlets, and from security cameras in public places 
and private buildings) with online medical data 
(e.g., emergency room admissions), it would be 
feasible today to implement a simple system to 
telephone individuals immediately if a person 
they were in close contact with yesterday was just 
admitted to the emergency room with an infec- 
tious disease, alerting them to the symptoms they 
should watch for and precautions they should 
take. Here, there is clearly a tension and trade-off 
between personal privacy and public health, and 
society at large needs to make the decision on 
how to make this trade-off. The larger point of 
this example, however, is that, although the data 
are already online, we do not currently have the 
laws, customs, culture, or mechanisms to enable 


society to benefit from them, if it wishes to do so. 
In fact, much of these data are privately held and 
owned, even though they are data about each of 
us. Considerations such as these suggest that 
machine learning is likely to be one of the most 
transformative technologies of the 21st century. 
Although it is impossible to predict the future, it 
appears essential that society begin now to con- 
sider how to maximize its benefits. 
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REVIEW 


Advances in natural 
language processing 


Julia Hirschberg’ and Christopher D. Manning” 


Natural language processing employs computational techniques for the purpose of learning, 
understanding, and producing human language content. Early computational approaches to 
language research focused on automating the analysis of the linguistic structure of language 
and developing basic technologies such as machine translation, speech recognition, and speech 
synthesis. Today’s researchers refine and make use of such tools in real-world applications, 
creating spoken dialogue systems and speech-to-speech translation engines, mining social 
media for information about health or finance, and identifying sentiment and emotion toward 
products and services. We describe successes and challenges in this rapidly advancing area. 


ver the past 20 years, computational lin- 
guistics has grown into both an exciting 
area of scientific research and a practical 
technology that is increasingly being in- 
corporated into consumer products (for 
example, in applications such as Apple’s Siri and 
Skype Translator). Four key factors enabled these 
developments: (i) a vast increase in computing 
power, (ii) the availability of very large amounts 
of linguistic data, (iii) the development of highly 
successful machine learning (ML) methods, and 
(iv) a much richer understanding of the structure 
of human language and its deployment in social 
contexts. In this Review, we describe some cur- 
rent application areas of interest in language 
research. These efforts illustrate computational 
approaches to big data, based on current cutting- 
edge methodologies that combine statistical anal- 
ysis and ML with knowledge of language. 
Computational linguistics, also known as nat- 
ural language processing (NLP), is the subfield 
of computer science concerned with using com- 
putational techniques to learn, understand, and 
produce human language content. Computation- 
al linguistic systems can have multiple purposes: 
The goal can be aiding human-human commu- 
nication, such as in machine translation (MT); 
aiding human-machine communication, such as 
with conversational agents; or benefiting both 
humans and machines by analyzing and learn- 
ing from the enormous quantity of human lan- 
guage content that is now available online. 
During the first several decades of work in 
computational linguistics, scientists attempted 
to write down for computers the vocabularies 
and rules of human languages. This proved a 
difficult task, owing to the variability, ambiguity, 
and context-dependent interpretation of human 
languages. For instance, a star can be either an 
astronomical object or a person, and “star” can 
be a noun or a verb. In another example, two in- 
terpretations are possible for the headline “Teacher 
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strikes idle kids,” depending on the noun, verb, and 
adjective assignments of the words in the sentence, 
as well as grammatical structure. Beginning in the 
1980s, but more widely in the 1990s, NLP was 
transformed by researchers starting to build mod- 
els over large quantities of empirical language 
data. Statistical or corpus (“body of words”)-based 
NLP was one of the first notable successes of 
the use of big data, long before the power of 
ML was more generally recognized or the term 
“big data” even introduced. 

A central finding of this statistical approach to 
NLP has been that simple methods using words, 
part-of-speech (POS) sequences (such as whether 
a word is a noun, verb, or preposition), or simple 
templates can often achieve notable results when 
trained on large quantities of data. Many text 
and sentiment classifiers are still based solely on 
the different sets of words (“bag of words”) that 
documents contain, without regard to sentence 
and discourse structure or meaning. Achieving 
improvements over these simple baselines can be 
quite difficult. Nevertheless, the best-performing 
systems now use sophisticated ML approaches 
and a rich understanding of linguistic structure. 
High-performance tools that identify syntactic 
and semantic information as well as information 
about discourse context are now available. One 
example is Stanford CoreNLP (J), which provides 
a standard NLP preprocessing pipeline that in- 
cludes POS tagging (with tags such as noun, verb, 
and preposition); identification of named entities, 
such as people, places, and organizations; parsing 
of sentences into their grammatical structures; 
and identifying co-references between noun 
phrase mentions (Fig. 1). 

Historically, two developments enabled the 
initial transformation of NLP into a big data field. 
The first was the early availability to researchers 
of linguistic data in digital form, particularly 
through the Linguistic Data Consortium (LDC) 
(2), established in 1992. Today, large amounts 
of digital text can easily be downloaded from 
the Web. Available as linguistically annotated 
data are large speech and text corpora anno- 
tated with POS tags, syntactic parses, semantic 
labels, annotations of named entities (persons, 
places, organizations), dialogue acts (statement, 


question, request), emotions and positive or neg- 
ative sentiment, and discourse structure (topic 
or rhetorical structure). Second, performance im- 
provements in NLP were spurred on by shared 
task competitions. Originally, these competitions 
were largely funded and organized by the U.S. 
Department of Defense, but they were later or- 
ganized by the research community itself, such 
as the CoNLL Shared Tasks (3). These tasks were 
a precursor of modern ML predictive modeling 
and analytics competitions, such as on Kaggle (4), 
in which companies and researchers post their 
data and statisticians and data miners from all over 
the world compete to produce the best models. 

A major limitation of NLP today is the fact that 
most NLP resources and systems are available 
only for high-resource languages (HRLs), such as 
English, French, Spanish, German, and Chinese. 
In contrast, many low-resource languages (LRLs)— 
such as Bengali, Indonesian, Punjabi, Cebuano, 
and Swahili—spoken and written by millions of 
people have no such resources or systems avail- 
able. A future challenge for the language commu- 
nity is how to develop resources and tools for 
hundreds or thousands of languages, not just a few. 


Machine translation 


Proficiency in languages was traditionally a hall- 
mark of a learned person. Although the social 
standing of this human skill has declined in the 
modern age of science and machines, translation 
between human languages remains crucially im- 
portant, and MT is perhaps the most substantial 
way in which computers could aid human-human 
communication. Moreover, the ability of com- 
puters to translate between human languages 
remains a consummate test of machine intel- 
ligence: Correct translation requires not only 
the ability to analyze and generate sentences in 
human languages but also a humanlike under- 
standing of world knowledge and context, de- 
spite the ambiguities of languages. For example, 
the French word “bordel” straightforwardly means 
“brothel”; but if someone says “My room is un 
bordel,” then a translating machine has to know 
enough to suspect that this person is probably not 
running a brothel in his or her room but rather is 
saying “My room is a complete mess.” 

Machine translation was one of the first non- 
numeric applications of computers and was studied 
intensively starting in the late 1950s. However, the 
hand-built grammar-based systems of early dec- 
ades achieved very limited success. The field was 
transformed in the early 1990s when researchers 
at IBM acquired a large quantity of English and 
French sentences that were translations of each 
other (known as parallel text), produced as the 
proceedings of the bilingual Canadian Parliament. 
These data allowed them to collect statistics of 
word translations and word sequences and to 
build a probabilistic model of MT (5). 

Following a quiet period in the late 1990s, 
the new millennium brought the potent combina- 
tion of ample online text, including considerable 
quantities of parallel text, much more abundant 
and inexpensive computing, and a new idea 
for building statistical phrase-based MT systems 
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(6). Rather than translating word by word, the 
key advance is to notice that small word groups 
often have distinctive translations. The Japa- 
nese Ke “mizu iro” is literally the sequence 
of two words (“water color”), but this is not the 
correct meaning (nor does it mean a type of 
painting); rather, it indicates a light, sky-blue color. 
Such phrase-based MT was used by Franz Och in 
the development of Google Translate. 

This technology enabled the services we have 
today, which allow free and instant translation 
between many language pairs, but it still pro- 
duces translations that are only just serviceable 
for determining the gist of a passage. However, 
very promising work continues to push MT for- 
ward. Much subsequent research has aimed to 
better exploit the structure of human language 
sentences (i.e., their syntax) in translation sys- 
tems (7, 8), and researchers are actively building 
deeper meaning representations of language (9) 
to enable a new level of semantic MT. 

Finally, just in the past year, we have seen the 
development of an extremely promising approach 
to MT through the use of deep-learning-based 
sequence models. The central idea of deep learn- 
ing is that if we can train a model with several 
representational levels to optimize a final objec- 
tive, such as translation quality, then the model 
can itself learn intermediate representations 
that are useful for the task at hand. This idea 
has been explored particularly for neural net- 
work models in which information is stored in 
real-valued vectors, with the mapping between 
vectors consisting of a matrix multiplication fol- 
lowed by a nonlinearity, such as a sigmoid func- 
tion that maps the output values of the matrix 
multiplication onto [-1, 1]. Building large models 


Part of speech: 


——— 


NNP (VBD)_{IN) (NNP} 
es —— 


of this form is much more practical with the 
massive parallel computation that is now econo- 
mically available via graphics processing units. For 
translation, research has focused on a particular 
version of recurrent neural networks, with enhanced 
“long short-term memory” computational units 
that can better maintain contextual information 
from early until late in a sentence (JO, 1D) (Fig. 2). 
The distributed representations of neural networks 
are often very effective for capturing subtle seman- 
tic similarities, and neural MT systems have al- 
ready produced some state-of-the-art results (72, 13). 

A still-underexplored area in MT is getting ma- 
chines to have more of a sense of discourse, so 
that a sequence of sentences translates naturally— 
although work in the area has begun (/4). Finally, 
MT is not necessarily a task for machines to do 
alone. Rather it can be reconceptualized as an op- 
portunity for computer-supported cooperative 
work that also exploits human skills (75). In such 
a system, machine intelligence is aimed at human- 
computer interface capabilities of giving effective 
suggestions and reacting productively to human 
input, rather than wholly replacing the skills and 
knowledge of a human translator. 


Spoken dialogue systems and 
conversational agents 


Dialogue has been a popular topic in NLP re- 
search since the 1980s. However, early work on 
text-based dialogue has now expanded to include 
spoken dialogue on mobile devices (e.g., Apple’s 
Siri, Amtrak’s Julie, Google Now, and Microsoft’s 
Cortana) for information access and task-based 
apps. Spoken dialogue systems (SDSs) also allow 
robots to help people with simple manual tasks 
[e.g., Manuela Veloso’s CoBots (J6)] or provide 


therapy for less-abled persons [e.g., Maja Mataric’s 
socially assistive robots (17)]. They also enable ava- 
tars to tutor people in interview or negotiation stra- 
tegies or to help with health care decisions (18, 19). 
The creation of SDSs, whether between hu- 
mans or between humans and artificial agents, 
requires tools for automatic speech recognition 
(ASR), to identify what a human says; dialogue 
management (DM), to determine what that hu- 
man wants; actions to obtain the information or 
perform the activity requested; and text-to-speech 
(TTS) synthesis, to convey that information back 
to the human in spoken form. (Fig. 3). In addition, 
SDSs need to be ready to interact with users when 
an error in speech recognition occurs; to decide 
what words might be incorrectly recognized; and 
to determine what the user actually said, either 
automatically or via dialogue with the user. In 
speech-to-speech translation systems, MT com- 
ponents are also needed to facilitate dialogue 
between speakers of different languages and the 
system, to identify potential mistranslations before 
they occur, and to clarify these with the speaker. 
Practical SDSs have been enabled by break- 
throughs in speech recognition accuracy, mainly 
coming from replacing traditional acoustic feature- 
modeling pipelines with deep-learning models that 
map sound signals to sequences of human language 
sounds and words (20). Although SDSs now work 
fairly well in limited domains, where the topics of 
the interaction are known in advance and where 
the words people are likely to use can be predeter- 
mined, they are not yet very successful in open- 
domain interaction, where users may talk about 
anything at all. Chatbots following in the tradition 
of ELIZA (27) handle open-domain interaction by 
cleverly repeating variations of the human input; 
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Fig. 1. Many language technology tools start by doing linguistic structure analysis. Here we show output from Stanford CoreNLP. As shown from top to 
bottom, this tool determines the parts of speech of each word, tags various words or phrases as semantic named entities of various sorts, determines which entity 
mentions co-refer to the same person or organization, and then works out the syntactic structure of each sentence, using a dependency grammar analysis. 
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weekend <EOS> 


protests escalated over the 


Die proteste waren 


am wochenende eskaliert <EOS> 


The protests escalated over the weekend 


Fig. 2. A deep, recurrent neural MT system (10). Initially trained on parallel sentences that translate each 
other, the model learns a representation of each word as a real-valued vector and internal parameter matrices so 
as to optimize translation quality. The trained network can then translate new sentences. Each arrow represents 
a computation unit of a matrix multiplication followed by a nonlinear transformation; the small vectors shown in 
the illustration might really be 1000-dimensional. The recurrent network first encodes the meaning of the source 
sentence (left side, blue). It maintains an internal state representing a partial sentence, which is updated after 
each new word is read (horizontal arrows). Having the upper network layers [mapped to by additional (upper 
vertical) computation arrows] makes this a deep recurrent network. Adding depth improves the ability of the 
model to learn, generalize, and remember. Once the end of the sentence (denoted by <EOS>) is reached 
(middle, dark blue), the network additionally starts to produce a word of translated output at each step from its 
internal state (using a multiclass logistic regression—-style model). During translation generation (right side, 
green), the last generated word is fed in as the input at each step. From the stored hidden state and this input, 
the model calculates the next word of the translation. This process repeats until <EOS> is generated. 


this approach is also being attempted in spoken- 
chat systems designed to provide a sense of com- 
panionship for target audiences such as the elderly 
or individuals with dementia (22). In spoken dia- 
logue, information about the speaker’s mental 
state inferred from multimodal information can 
be used to supplement the system’s knowledge of 
what the user is saying. 

There are many challenges in building SDSs, 
in addition to the primary challenge of improv- 
ing the accuracy of the basic ASR, DM, and TTS 
building blocks and extending their use into less- 
restricted domains. These include basic problems 
of recognizing and producing normal human 
conversational behaviors, such as turn-taking 
and coordination. Humans interpret subtle cues 
in speakers’ voices and facial and body gestures 
(where available) to determine when the speaker 
is ready to give up the turn versus simply pausing. 
These cues, such as a filled pause (e.g., “um” or 
“ah”), are also used to establish when some 
feedback from the listener is desirable, to indi- 
cate that he or she is listening or working on a 
request, as well as to provide “grounding” (i.e., 
information about the current state of the con- 
versation). Non-humanlike latency often makes 
SDS burdensome, as users must wait seconds to 
receive a system response. To address this, re- 
searchers are exploring incremental processing 
of ASR, MT, and TTS modules, so that systems 
can respond more quickly to users by beginning 
these recognition, translation, and generation 
processes while the user is still speaking. Hu- 
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mans can also disambiguate words such as “yeah” 
and “okay,” which may have diverse meanings— 
including agreement, topic shift, and even 
disagreement—when spoken in different ways. 
In successful and cooperative conversations, hu- 
mans also tend to entrain to their conversational 
partners, becoming more similar to each other in 
pronunciation, word choice, acoustic and pro- 
sodic features, facial expressions, and gestures. 
This tendency has long been used to subtly in- 
duce SDS users to employ terms that the system 
can more easily recognize. Currently, research- 
ers are beginning to believe that systems (parti- 
cularly embodied agents) should entrain to their 
users in these different modalities, and some ex- 
perimental results have shown that users prefer 
such systems (23) and even think they are more 
intelligent (24). Open issues for DM have long been 
the determination of how to architect the appro- 
priate dialogue flow for particular applications, 
where existing experimental data may be sparse 
and some aspects of the dialogue state may not yet 
have been observed or even be observable from the 
data. Currently, the most widely used approach is 
the POMDP (partially observable Markov decision 
process), which attempts to identify an optimal 
system policy by maintaining a probability distri- 
bution over possible SDS states and updating this 
distribution as the system observes additional 
dialogue behavior (25). This approach may make 
use of the identification of dialogue acts, such as 
whether the user input represents a question, 
statement, or indication of agreement, for example. 


Machine reading 

The printed word has great power to enlighten. 
Machine reading is the idea that machines could 
become intelligent, and could usefully integrate 
and summarize information for humans, by read- 
ing and understanding the vast quantities of text 
that are available. 

In the early decades of artificial intelligence, many 
researchers focused on the approach of trying to 
enable intelligent machines by manually building 
large structured knowledge bases in a formal logi- 
cal language and developing automated reasoning 
methods for deriving further facts from this knowl- 
edge. However, with the emergence of the mod- 
ern online world, what we mainly have instead is 
huge repositories of online information coded in 
human languages. One place where this is true is 
in the scientific literature, where findings are still 
reported almost entirely in human language text 
(with accompanying tables and diagrams). How- 
ever, it is equally true for more general knowledge, 
where we now have huge repositories of infor- 
mation such as Wikipedia (26). The quantity of 
scientific literature is growing rapidly: For example, 
the size of the U.S. National Library of Medicine’s 
Medline index has grown exponentially (27). At 
such a scale, scientists are unable to keep up with 
the literature, even in their narrow domains of 
expertise. Thus, there is an increased need for 
machine reading for the purposes of comprehend- 
ing and summarizing the literature, as well as ex- 
tracting facts and hypotheses from this material. 

An initial goal is to extract basic facts, most 
commonly a relation between two entities, such 
as “child of’ (for instance, Bill Clinton, Chelsea 
Clinton). This is referred to as relation extrac- 
tion. For particular domain-specific relations, 
many such systems have been successfully built. 
One technique is to use handwritten patterns 
that match the linguistic expression of relations 
(e.g., <PERSON>’s daughter, <PERSON>). Bet- 
ter results can be obtained through the use of 


Speech in 


Fig. 3. A spoken dialogue system. The three main 
components are represented by rectangles; arrows 
denote the flow of information. 
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ML. A structured prediction classifier proposes 
instances of such relations based on extracted 
features from the sequence of words and gram- 
matical structure of a sentence (28, 29). Such sys- 
tems are the mainstay of literature fact-extraction 
tools in fields such as biomedicine (30, 37). 

In many scientific fields, there have been ma- 
jor efforts to build databases of structured infor- 
mation based on the textual scientific record, 
such as the Gene Ontology database (32) in bio- 
medicine or the PaleoBiology Database for fossil 
records (33). This has generally been done man- 
ually, via concerted work by trained profes- 
sionals. Using artificial intelligence software to 
extract these databases, as well as to perform 
subsequent reasoning and hypothesis genera- 
tion, has become a major research goal. One 
subfield where these questions have been ac- 
tively pursued is pharmacogenomics (34). For 
example, Percha et al. (35) trained a model of 
drug-drug interactions based on drug-gene in- 
teractions extracted from the literature and were 
able to use it to predict novel drug-drug interactions. 

Ifa partial knowledge base—for instance, Freebase 
(36), dbpedia (37), Wikidata (38) (related to Wikipe- 
dia), or the Gene Ontology database (32)—has already 
been extracted from biomedical research articles, 
then there is an opportunity to automatically align 
known facts from the knowledge base with puta- 
tive expressions of those facts in text. The type labels 
from this mapping can then be used as if they were 
supervised data for ML information-extraction sys- 
tems (Fig. 4). This is referred to as distantly super- 
vised relation extraction. Early systems aligned 
entity mentions and then made the naive assump- 
tion that sentences containing a pair of entities 
expressed every known relation between the two 
entities in the database (39). More recent systems 
have used increasingly sophisticated probabilistic 
inference to discern which textual clauses map to 
which facts in the knowledge base, or to something 
else entirely (40, 47). A dramatic recent applica- 
tion of this approach has been the DeepDive sys- 
tem (42), which aims to automate the construction 
of such systems by providing efficient large-scale 
learning and inference so a user can simply focus 
on good features for their domain. PaleoDeepDive, 
its application to the fossil record, has recently been 
shown to do a better job at fact extraction from 
journal articles than the scientist volunteers who 
maintain the PaleoBiology Database (43). 

The relation-extraction task is made general, 
if less semantically precise, by aiming to ex- 
tract all relations from any piece of text, a task 
normally referred to as open information ex- 
traction (Open IE) (44). Early work emphasized 
the development of simple but highly scalable 
fact-extraction techniques that do not require 
any kind of hand-labeled data (45). With ever- 
growing computational power, a second genera- 
tion of work increasingly emphasized careful use 
of linguistic structure, which can reliably be ex- 
tracted with the use of detailed NLP (46). 

Currently, a number of avenues are being 
explored to further extend the ability of compu- 
ters to build and use knowledge bases starting 
from textual information. An exciting unification 
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is the proposal for universal schemas (47), which 
allow simultaneous inference and knowledge- 
base completion over both the open set of textual 
relations (such as “born in”) found in Open IE 
and the more exact schema of databases (such 
as per:city_of_birth). Even with all of our text- 
extraction techniques, any knowledge base will 
only be partial and incomplete; some recent 
work explores how it can be probabilistically com- 
pleted to deliver a form of common-sense rea- 
soning (48). Finally, we hope to move beyond 
simply extracting relations, events, and facts to 
be able to understand the relations between 
events (such as causation) and complex multistep 
procedures and processes. In (49), Berant et al. 
explore how this can be done for understanding 
the steps in biological processes, showing that 
extracting explicit process structures can improve 
the accuracy of question answering. The flip side of 
machine reading is to provide question-answering 
systems, by which humans can get answers from 
constructed knowledge bases. There has recently 
been dramatic progress in building such systems 
by learning semantic parsers (50). 


Mining social media 


The development of social media has revolution- 
ized the amount and types of information avail- 
able today to NLP researchers. Data available 
from sources such as Twitter, Facebook, YouTube, 
blogs, and discussion forums make it possible 
to examine relations between demographic in- 
formation, language use, and social interaction 
(51). Researchers use Web-scraping techniques, 
often via application program interfaces pro- 
vided by websites, to download previously unim- 
aginable amounts and categories of data. Using 
statistical and ML techniques, they learn to iden- 
tify demographic information (such as age and 
gender) from language, track trending topics and 
popular sentiment, identify opinions and beliefs 
about products and politicians, predict disease 
spreading (for instance, with Google Flu Trends: 
www.google.org/flutrends/) from symptoms men- 
tioned in tweets or food-related illnesses (52), 
recognize deception in fake reviews (53), and 
identify social networks of people who interact 
together online. 

In this era of big data, the availability of social 
media has revolutionized the ways advertisers, 
journalists, businesses, politicians, and medical 
experts acquire their data and the ways in which 
those data can be put to practical use. Product 
reviews can be mined to predict pricing trends 
and assess advertising campaigns. Political forums 
can be searched to predict candidate appeal and 
performance in elections. Social networks can be 
examined to find indicators of power and influ- 
ence among different groups. Medical forums can 
be studied to discover common questions and 
misconceptions about sufferers from particular 
medical conditions so that website information 
can be improved. 

Social media also provide very large and rich 
sources of conversational data in Web forums 
that can provide “found” data for the study of lan- 
guage phenomena such as code-switching (mixed 
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per: city_of_birth 


Fig. 4. Distantly supervised learning. |n this ap- 
proach, facts in a structured knowledge repre- 
sentation are projected onto pieces of text that 
mention the people, places, dates, etc., that appear 
in knowledge-base entries. This projection is noisy, 
but when done over large quantities of text, it pro- 
vides enough signal to successfully learn good clas- 
sifiers for extracting relations from text. [Photo 
source: National Library of Australia, http://nla.gov. 
au/nla.pic-anl12267621] 
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language in bilingual speech), hedging behavior 
(words and phrases indicating lack of commitment 
to a proposition such as “sort of”), and hate speech 
or bullying behavior. Social media exist in a wide 
variety of languages, including both HRLs and 
LRLs. These data can be invaluable for enriching 
ASR language models and developing TTS synthe- 
sizers without the need to create costly special- 
purpose corpora. In turn, these technologies can 
be useful in producing SDSs in LRL areas. Such 
systems can provide millions of people with the 
ability to obtain information over their cell phones 
(which are ubiquitous, even among populations 
with low literacy rates or whose languages or 
dialects have no standard written form), similar to 
the residents of HRL countries. The development 
of tools for LRLs from found LRL data, by adapt- 
ing HRL tools, is another important way to use 
found text data. A particular application of data 
mining in LRLs is the mining of data collected 
from Twitter or blogs to provide valuable infor- 
mation for disaster relief organizations, identify- 
ing the most serious problems, where they occur, 
and who is experiencing them. 

There are also some drawbacks to social media 
data mining. There is an increasing concern for 
privacy issues, particularly for an individual’s con- 
trol over their own data versus researchers’ desire 
to mine it. Sites such as Twitter severely limit a 
researcher’s ability to download data, which im- 
pedes speedy corpus collection. There is also a 
major issue with discovering “ground truth” in 
online postings, because there is no clear way of 
validating an individual's demographic information; 
the validity of posts concerning events; and most 
reviews of hotels, restaurants, and products. Aggre- 
gating information from multiple sources at similar 
times can address some validity issues, and sites do 
attempt to identify spurious reviews, but this issue 
remains perhaps the most difficult one for those 
working with social media data. 


Analysis and generation of speaker state 


Speaker states (54), also termed “private states” 
(55), include opinions, speculations, beliefs, emo- 
tions, and any other evaluative views that are 
personally held by the speaker or writer of a 
language. Much of the work in NLP has focused 
on sentiment analysis (identification of positive 
or negative orientation of textual language) and 
identification of belief states (committed belief, 


Breakfast on Broadway is a new place focusi 
it, breakfast/brunch. Went there last Sunday around 1. The food 
was not bad but the service was pretty terrible. We had to wait 15 
minutes just to get menus and another 30 to get something to 
eat. And there were only a few tables occupied! If you don't mind 
the wait though, the price is right. I'll probably give it another try. 
Maybe they need time to get their act together. 


uncommitted belief, or neutrality of a sentence) 
on the basis of lexical and syntactic information. 
Both sentiment and belief constitute attitudes to- 
ward events and propositions, although sentiment 
can also concern attitudes toward objects such as 
people, organizations, and abstract concepts. De- 
tection of sentiment and emotion in text requires 
lexical and sentence-level information. Sentiment 
can be signaled by words conveying positive or 
negative orientation: For example, “sad,” “worried,” 
“difficult,” and “weak” are all words with negative 
orientation, whereas “comfortable,” “important,” 
“successful,” and “interesting” convey a positive 
sentiment. Online sentiment dictionaries, such as 
Whissel’s Dictionary of Affect (56), and systems 
created from subject-ranked terms, such as Tausczik 
and Pennebaker’s LIWC (Linguistic Inquiry and 
Word Count) (57), can be used to assess positive 
and negative sentiment in a text. More sophis- 
ticated approaches to sentiment analysis also seek 
to identify the holder (source) as well as the object 
of the sentiment: for instance, who is positive about 
what person, country, activity, or concept (55). 

The speech community has also studied pos- 
itive and negative attitudes by focusing more 
generally on the identification of positive and neg- 
ative emotions, primarily using acoustic and pro- 
sodic information. However, more work is currently 
being done to identify particular emotions, such 
as Ekman’s classic six basic emotions (anger, dis- 
gust, fear, happiness, sadness, surprise), which 
may be reactions to events, propositions, or objects. 
There has also been considerable research using 
features that have proven important in recog- 
nizing classic emotions to identify other speak- 
er states (such as deception), medical conditions 
(such as autism and Parkinson’s disease), speaker 
characteristics (such as age, gender, likeability, 
pathology, and personality), and speaker conditions 
(such as cognitive load, drunkenness, sleepiness, 
interest, and trust). Corpora collected for such 
studies have been used in the Interspeech Para- 
linguistic Challenges, which have been conducted 
since 2009. Emotion generation has proven a 
more difficult challenge for TTS synthesis. Although 
there are some systems (e.g., MARY) that at- 
tempt to generate emotions such as depression, 
aggression, or cheerfulness (58), the best synthe- 
sized emotion still comes from corpora recorded for 
particular emotions by voice talent imitating those 
emotions. 


ng on, you guessed 


together.] 


Sentiment classification is widely used in opin- 
ion identification (positive or negative views of 
people, institutions, or ideas) in many languages 
and genres. Particular applications abound, such 
as identifying positive and negative movie or pro- 
duct reviews (59, 60) and predicting votes from con- 
gressional records (67) or Supreme Court decisions 
from court proceedings. Figure 5 illustrates a typical 
restaurant review, annotated for positive, negative, 
and neutral sentiment, as well as basic emotions. 

Mining social media for sentiment or classic emo- 
tions has been a particularly popular topic for the 
purposes of assessing the “public mood” from Twit- 
ter, predicting stock market trends, or simply eval- 
uating a community’s mental state (62). Social media 
such as Twitter, blog posts, and forums also provide 
researchers with very large amounts of data to use in 
assessing the role of sentiment and emotion in 
identifying other linguistic or social phenomena 
[e.g., sarcasm (63), power relationships, and so- 
cial influence (64)], as well as mental health is- 
sues [e.g., depression (65)]. 


Conclusion and outlook 


Many times during the past 50 years, enthusi- 
astic researchers have had high hopes that the 
language-understanding ability of robots in sci- 
ence fiction movies was just around the corner. 
However, in reality, speech and language under- 
standing did not work well enough at that time 
to power mainstream applications. The situation 
has been changing dramatically over the past five 
years. Huge improvements in speech recognition 
have made talking to your phone a commonplace 
activity, especially for young people. Web search 
engines are increasingly successful in understanding 
complex queries, and MT can at least yield the gist 
of material in another language, even if it cannot 
yet produce human-quality translations. Com- 
puter systems trade stocks and futures auto- 
matically, based on the sentiment of reports about 
companies. As a result, there is now great com- 
mercial interest in the deployment of human 
language technology, especially because natural 
language represents such a natural interface 
when interacting with mobile phones. In the 
short term, we feel confident that more data 
and computation, in addition to recent advances 
in ML and deep learning, will lead to further 
substantial progress in NLP. However, the truly 
difficult problems of semantics, context, and 


Breakfast on Broadway is a new place focusing on, you guessed it, 
breakfast/brunch. Went there last Sunday around 1. The food was 
not bad but [Anger: the service was pretty terrible]. [Disgust: We 
had to wait 15 minutes just to get menus and another 30 to get 
something to eat. And there were only a few tables occupied!] If 
you don’t mind the wait though, the price is right. I'll probably give 
it another try. [Uncertainty: Maybe they need time to get their act 


Fig. 5. Manually annotated text analysis on a sample restaurant review. Sentiment analysis is shown on the left (blue, positive sentiments; red, negative; gray, 
neutral). In the emotion analysis on the right, emotions are shown in bold type and delineated by square brackets. Note in particular the importance of going beyond 
simple keyword analysis; for example, “not” has scope over “bad,” which might mislead simple systems. Also, the presence of “hedge” words and phrases, which muddle 
the intended meaning (e.g., “pretty,” which has a positive connotation, modifying the negative word “terrible”), somewhat decreases the negative score of the next clause. 
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knowledge will probably require new discov- 
eries in linguistics and inference. From this per- 
spective, it is worth noting that the development 
of probabilistic approaches to language is not 
simply about solving engineering problems: Prob- 
abilistic models of language have also been reflected 
back into linguistic science, where researchers are 
finding important new applications in describing 
phonology (66), understanding human language 
processing (67), and modeling linguistic seman- 
tics and pragmatics (68). Many areas of linguistics 
are themselves becoming more empirical and 
more quantitative in their approaches. 
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REVIEW 


Economic reasoning and 
artificial intelligence 


David C. Parkes’* and Michael P. Wellman?* 


The field of artificial intelligence (Al) strives to build rational agents capable of perceiving the 
world around them and taking actions to advance specified goals. Put another way, Al 
researchers aim to construct a synthetic homo economicus, the mythical perfectly rational 
agent of neoclassical economics. We review progress toward creating this new species of 
machine, machina economicus, and discuss some challenges in designing Als that can reason 
effectively in economic contexts. Supposing that Al succeeds in this quest, or at least comes 
close enough that it is useful to think about Als in rationalistic terms, we ask how to design the 
rules of interaction in multi-agent systems that come to represent an economy of Als. Theories of 
normative design from economics may prove more relevant for artificial agents than human 
agents, with Als that better respect idealized assumptions of rationality than people, interacting 


through novel rules and incentive systems quite distinct from those tailored for people. 


conomics models the behavior of people, 

firms, and other decision-makers as a means 

to understand how these decisions shape 

the pattern of activities that produce value 

and ultimately satisfy (or fail to satisfy) hu- 
man needs and desires. In this enterprise, the 
field classically starts from an assumption that 
actors behave rationally—that is, their decisions 
are the best possible given their available actions, 
their preferences, and their beliefs about the out- 
comes of these actions. Economics is drawn to 
rational decision models because they directly con- 
nect choices and values in a mathematically precise 
manner. Critics argue that the field studies a mythi- 
cal species, homo economicus (“economic man”) 
and produces theories with limited applicability 
to how real humans behave. Defenders acknowl- 
edge that rationality is an idealization but counter 
that the abstraction supports powerful analysis, 
which is often quite predictive of people’s behav- 
ior (as individuals or in aggregate). Even if not 
perfectly accurate representations, rational models 
also allow preferences to be estimated from ob- 
served actions and build understanding that can 
usefully inform policy. 

Artificial intelligence (AI) research is likewise 
drawn to rationality concepts, because they pro- 
vide an ideal for the computational artifacts it 
seeks to create. Core to the modern conception of 
Al is the idea of designing agents: entities that 
perceive the world and act in it (2). The quality of 
an AI design is judged by how well the agent’s 
actions advance specified goals, conditioned on 
the perceptions observed. This coherence among 
perceptions, actions, and goals is the essence of 
rationality. If we represent goals in terms of pref- 
erence over outcomes, and conceive perception 
and action within the framework of decision- 
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making under uncertainty, then the AI agent’s 
situation aligns squarely with the standard eco- 
nomic paradigm of rational choice. Thus, the AI 
designer’s task is to build rational agents, or agents 
that best approximate rationality given the limits 
of their computational resources (2-4). In other 
words, AI strives to construct—out of silicon (or 
whatever) and information—a synthetic homo eco- 
nomicus, perhaps more accurately termed mach- 
ina economicus. 

The shared rationality abstraction provides a 
strong foundation for research that spans AI and 
economics. We start this review by describing pro- 
gress on the question of how to operationalize 
rationality and how to construct AI agents that 
are able to reason about other Als. Supposing 
that AI research succeeds in developing an agent 
that can be usefully modeled as rational (perhaps 


more so than human agents), we turn to research 
on the design of systems populated by multiple 
Als. These multi-agent systems will function as 
AI economies, with Als engaged in transactions 
with other Als as well as with firms and people. 
This prospect has spawned interest in expanding 
theories of normative design from economics, op- 
timizing rules of encounter (5) to guide multi-agent 
interactions. Systems populated by Als may exhibit 
new economic phenomena and thus require a new 
science with which to understand the way they 
function and to guide their design. For example, 
although human cognitive constraints limit the 
design of current markets, systems designed for 
Als may admit more complex interfaces, impose 
greater calculation burdens, and demand more 
stamina of attention. 

At the same time, the ways in which the behav- 
ior of Als deviate from the behavior of people can 
present new challenges. We can already glimpse 
the future of economic Als, with simple AI bots 
pricing books for sale on Amazon and scanning 
for restaurant tables on Opentable for resale at a 
profit (6). Such AIs may introduce some efficien- 
cies, but their lack of common sense and their 
designer’s failure to anticipate interactions can 
also lead to books priced at $23 million (7). More 
sophisticated AI strategies, presumably more 
carefully vetted, exert a large influence on finan- 
cial markets, with automated trading algorithms 
estimated to be responsible for more than 70% of 
trades on U.S. stock markets (8). Given the conse- 
quences, it is important to understand the effect 
of ubiquitous automated agents on the perform- 
ance of economic systems. As reasoning is shifted 
from people to Als—designed to learn our pref- 
erences, overcome our decision biases, and make 
complex cost-benefit trade-offs—how too should 
the economic institutions that mediate everyday 


transactions change? 
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Fig. 1. A bounded reinforcement learning agent performs better by pursuing a designed reward 
function different from the objective reward: its actual fitness evaluation. Results (left) from a 
gridworld foraging domain (right), for various limits on the agent's planning horizon (84). Unless the 
agent is perfectly rational (i.e., no horizon limit)—not typically feasible in realistic applications—the 
designer can often achieve better fitness by directing the agent to optimize an alternative measure. 
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Fig. 2. Researchers produced steady exponential progress on solving games of imperfect information from 1995 to the present. Up to 2007 (left), game 
size was generally reported in terms of nodes in the game tree. Based on methods introduced around that time, it became more meaningful (right) to report size in 
terms of the number of information sets (each many nodes), which represent distinct situations as perceived from the perspective of a player. The circled data 
points correspond to the same milestone; combining the two graphs thus demonstrates the continual exponential improvement. Data are from (23, 35, 85-90). 


We focus here on some of the research direc- 
tions we consider most salient for a future syn- 
thesis of economics and AI engendered by the 
emergence of machina economicus. Interesting 
as they are, we only briefly mention here the 
many exciting applications of AI to problems in 
economics such as matching (9), market clearing 
(10), and preference modeling for smart grids 
(11). Nor will we showcase the many ways in 
which economic theory is finding application 
today within AIl—for example, game-theoretic 
approaches to multi-agent learning (12) and 
voting procedures to combine the opinions of 
Als (13). 


Building machina economicus 


Constructing a rational AI raises a host of tech- 
nical challenges not previously addressed in the 
long tradition of rationalistic modeling in the 
social sciences. For economics, the agent atti- 
tudes (e.g., beliefs and preferences) underlying 
rationality are conceptual abstractions. Econo- 
mists need not explain how capabilities and 
preferences, for example, are encoded, nor the 
algorithm by which an agent plans what actions 
to take conditional on its perceptions. Compu- 
tation is abstracted away in the standard eco- 
nomic model and is precisely what the AI scientist 
must account for to operationalize rationality in a 
realized agent. 

This does not mean that an AI design needs to 
incorporate data structures corresponding directly 
to rationality constructs, although many AI archi- 
tectures do feature direct representations for prop- 
ositions, goals, and the like. Such representations 
may simplify the analysis of AI systems—for exam- 
ple, we can ask whether an inference algorithm 
operating on logical expressions possesses desir- 
able properties such as soundness: that all con- 
clusions follow from the premises. Similarly, if an 
Al’s beliefs are encoded as probability distribu- 
tions, we can ask whether it updates its beliefs 
from observations in proper accord with Bayesian 
theory. However, care must be taken in under- 
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standing an agent’s attitudes solely in terms of 
its internal data structures. Imperfections in 
decision-making may mean that the beliefs held 
and objectives pursued by a computational agent, 
in effect, vary systematically from those directly 
encoded. 

As an example illustrating this distinction, 
machine-learning researchers adapted from ani- 
mal learning the concept of reward shaping (74). 
In reinforcement learning, the agent derives a 
policy (mapping from perception sequences to 
actions) based on rewards representing instan- 
taneous value associated with a state and action. 
A designer specifying the input reward can often 
train the agent more efficiently by shaping the 
reward signal over the learning process to facil- 
itate convergence to behavior optimizing the 
designer’s objective. The framework of optimal 
rewards (15) provides a general treatment distin- 
guishing reward specifications and designer goals. 
As shown in Fig. 1, the optimal reward input to 
the agent does not generally correspond to the 
designer’s ideal reward. This perspective helps 
explain the role of intrinsic motivations (e.g., 
curiosity) in a flexible learning agent. 

Although the mantle of designing machina 
economicus may not be adopted (particularly 
in such explicit terms) by all AI researchers, 
many AI advances over the past few decades 
can be characterized as progress in operation- 
alizing rationality. For instance, probabilistic 
reasoning was largely eschewed by AI 30 years 
ago but now pervades the field, thanks to 
developments in representation and inference 
using Bayesian networks and related graphi- 
cal formalisms. Expressing uncertainty about 
general relationships, beyond mere proposi- 
tions, is routinely supported in probabilistic 
modeling languages (16). Statistical approaches 
now dominate machine learning and natural 
language processing (17, 18). Likewise, prefer- 
ence handling (including methods for eliciting 
preferences from the designer of an AI agent, 


compactly representing preferences over com- 


plex domains, and enabling inference about pre- 
ferences) is regarded as a necessary AI facility. 
Planning, the AI subfield concerned with action 
over time, now conventionally frames its prob- 
lem as one of optimization, subject to resource 
constraints, multiple objectives, and probabilistic 
effects of actions. 

Will AI succeed in developing the ideal ration- 
al agent? As much as we strive to create machina 
economicus, absolutely perfect rationality is un- 
achievable with finite computational resources. 
Amore salient question is whether AI agents will 
be sufficiently close to the ideal as to merit think- 
ing about them and interacting with them in 
rationalistic terms. Such is already the case, at 
least in a limited sense. Whenever we anthropo- 
morphize our machines, we are essentially treat- 
ing them as rational beings, responding to them 
in terms of our models of their knowledge, goals, 
and intentions. A more refined version of the 
question is whether our formal rationality the- 
ories will fit well the behavior of AI agents in 
absolute terms or compared to how well the 
theories work for people. Without offering any 
judgment on the question of how well rationality 
theories capture essential human behavior, we 
note the irony in the prospect that social science 
theories may turn out to apply with greater fidel- 
ity to nonhuman agent behavior. 


Reasoning about other agents 


The issue of agent theorizing is not merely aca- 
demic. If we can build one AI agent, then we can 
build many, and these Als will need to reason 
about each other as well as about people. For Als 
designed to approximate machina economicus, it 
stands to reason that they should treat each other 
as rational, at least as a baseline assumption. These 
Als would adopt a game-theoretic view of the 
world, where agents rationally respond to each 
others’ behavior, presumed (recursively) to be ra- 
tional as well. A consequence is that agents would 
expect their joint decisions to be in some form of 
equilibrium, as in standard economic thinking. 
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That Als (or Al-human combinations) are rea- 
sonably modeled as approximately rational is the 
premise of a growing body of AI research apply- 
ing economic equilibrium models to scenarios 
involving multiple agents (19). The approach has 
achieved notable successes, providing evidence 
for the premise, at least in particular circum- 
stances. Just as single-agent rationality does not 
require literal expected-utility calculations, appli- 
cability of an equilibrium model does not require 
that agents themselves be explicitly engaged in 
equilibrium reasoning. For example, the litera- 
ture on learning in games (20) has identified 
numerous conditions in which simple adaptive 
strategies converge to strategic equilibria. We 
can evaluate the effectiveness of economic mod- 
eling by examining agents built by AI designers 
for specified tasks. For instance, in a study of AI 
trading agents competing in a shopping game 
(21, an agent using standard price equilibrium 
models from economics (specifically, Walrasian 
equilibrium) achieved comparable prediction ac- 
curacy to sophisticated machine-learning ap- 
proaches without using any data, even though 
none of the other agents employed equilibrium 
reasoning. 


A Prisoner's dilemma 


Cooperate Defect 


Cooperate 


Defect 


poker game (23). HULHE is the largest game of 
imperfect information ever solved (with more 
than 10" information sets after removing sym- 
metries) and the first imperfect-information 
game widely played by humans to be solved. The 
solution was the culmination of two decades of 
effort by a series of researchers (see Fig. 2), 
beginning with the exact solution of simplified 
poker games, and proceeding to the approxi- 
mate solution of abstracted versions of the full 
game (24). Computing the approximate Nash 
equilibrium of the full game required massive 
computation and new methods for equilibrium 
search based on regret-matching techniques from 
machine learning. The result is a strategy against 
which even a perfect opponent cannot earn a 
detectable profit. 

In general, the optimal strategy against perfect 
opponents may not be the ideal strategy against 
the more typical fallible kind. Despite consider- 
able effort, however, researchers have not found 
poker algorithms that perform considerably bet- 
ter than game-theoretic solutions, even against 
natural distributions of opponents. It has also 
turned out that game-theoretic approaches have 
been more successful than alternatives, even for 


B Mediated Prisoner's dilemma 


Mediator Defect 


Cooperate 


Mediator 


Cooperate 


Defect 


Fig. 3. Each entry gives the utility to (row player, column player). (A) Prisoner's dilemma. The 
dominant strategy equilibrium is (Defect, Defect). (B) Mediated prisoner's dilemma. The dominant 


strategy equilibrium is (Mediator, Mediator). 


In the rest of this section, we describe fur- 
ther examples in which economic modeling, in 
the form of game-theoretic algorithms, has pro- 
vided an effective way for Als to reason about 
other agents. The first example is computer poker. 
Although poker is an artificial game, many humans 
have invested a great deal of time and money 
to develop their playing skills. More important, 
poker’s uncertainty and complexity have made 
it a compelling challenge problem for AI tech- 
niques. Early approaches aimed to capture the 
knowledge of expert human players (22), but 
over the past decade, game-theoretic algorithms 
have predominated. Technically, poker is a game 
of imperfect information, where each player 
knows elements of history (cards dealt to them) 
that are secret from others. As uncertainty gets 
partially resolved over time, through card turns 
and betting, players must update their beliefs 
about both card outcomes and the beliefs of 
others. 

A major milestone in computer poker was 
achieved in 2014 with the effective solution of 
“heads up limit hold’em” (HULHE), which is a 
standard two-player version of the most popular 
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poker variants that are far from being exactly solved, 
such as no-limit (where bets are unrestricted) 
(25), or games with three or more players (26). 

Much of the interest in game-theoretic rea- 
soning for AI is driven by its applicability to real- 
world problems. The most prominent area of 
application in recent years, and our second exam- 
ple, is that of security games, based on a pioneer- 
ing series of systems developed by Tambe et ai. 
(27). In these systems, an agent decides how to 
defend facilities (e.g., airport security through 
placement of checkpoints) by solving a game 
where an attacker is presumed to rationally plan 
in response to the defender’s decision. This ap- 
proach has been successfully deployed in a variety 
of domains, including airport and airline sec- 
urity and coast guard patrols. 

As for any game-theoretic approach, the rec- 
ommendations from these systems are sensitive 
to assumptions made about the other agents 
(here, attackers): their respective preferences, 
beliefs, capabilities, and level of rationality. Rep- 
resentational approaches from AI provide flexi- 
bility, allowing the assumptions made in the strict 
versions typically employed by game theorists to 


be relaxed (28). The field of behavioral game 
theory has developed detailed predictive models 
based on how humans have been observed to 
deviate from game-theoretic rationality (29). Such 
predictive models can be readily incorporated in 
existing game-theoretic reasoning algorithms, as 
has been demonstrated in the context of modeling 
attackers in security games (30). An interesting 
open question is whether the kinds of behavioral 
models that best explain human decision-making 
[see Wright and Leyton-Brown (37) for a meta- 
study] will also prove effective in capturing the 
bounded rationality of computational agents. 


Designing multi-agent systems 


At the multi-agent level, a designer cannot di- 
rectly program behavior of the Als but instead 
defines the rules and incentives that govern 
interactions among Als. The idea is to change 
the “rules of the game” (e.g., rewards associated 
with actions and outcomes) to effect change in 
agent behavior and achieve system-wide goals. 
System goals might include, for instance, pro- 
moting an allocation of resources to maximize 
total value, coordinating behavior to complete a 
project on time, or pooling decentralized infor- 
mation to form an accurate prediction about a 
future event. The power to change the interac- 
tion environment is special and distinguishes 
this level of design from the standard AI design 
problem of performing well in the world as given. 

An interesting middle ground is to take the 
world as given but employ reliable entities— 
mediators—that can interact with Als and per- 
form actions on their behalf (32). Introducing 
mediating entities is relatively straightforward 
in the new AI economy. To see how this can be 
powerful, consider a mediated extension of the 
classic prisoner’s dilemma game (Fig. 3). If both 
Als grant the mediator the authority to play on 
their behalf (i.e., proxy right), it performs Co- 
operate on behalf of both agents. However, if 
only one AI grants the mediator proxy, it per- 
forms Defect on behalf of that agent. In equi- 
librium, both Als grant proxy, and the effect is 
to change the outcome from (Defect, Defect) to 
(Cooperate, Cooperate), increasing utility to both 
participants. 

For the more general specification of rules of 
interaction for rational agents, economics has a 
well-developed mathematical theory of mecha- 
nism design (33). The framework of mechanism 
design has been fruitfully applied, for example, to 
the design of matching markets (34) and auc- 
tions (35). Mechanism design is a kind of inverse 
game theory, with the rules inducing a game and 
the quality of the system evaluated in an equi- 
librium. In the standard model, design goals are 
specified in terms of agent preferences on out- 
comes, but these preferences are private and the 
agents are self-interested. A mechanism is a trusted 
entity, able to receive messages from agents that 
make claims (perhaps untruthfully) about prefer- 
ences and select an outcome (e.g., an allocation of 
resources or a plan of behavior) on the basis of 
these messages. The challenge is to align incentives 
and promote truthful reports. 
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Varian (36) has argued that the theory of 
mechanism design may actually prove more rel- 
evant for artificial agents than for human agents, 
because Als may better respect the idealized 
assumptions of rationality made in this frame- 
work. For example, one desirable property of a 
mechanism is incentive compatibility, which stip- 
ulates that truthful reports constitute an equi- 
librium. Sometimes it is even possible to make 
truthful reporting a dominant strategy (optimal 
whatever others do), achieving the strong prop- 
erty of strategy-proofness (37). It seems, however, 
that people do not reliably understand this pro- 
perty; evidence from medical matching markets, 
and also from laboratory experiments, suggests 
that some participants in strategy-proof matching 
mechanisms try to misrepresent their preferences 
even though it provides no advantage (38, 39). 

For artificial systems, in comparison, we might 
expect Als to be truthful where this is optimal 
and to avoid spending computation reasoning 
about the behavior of others where this is not 
useful (5). More generally, mechanism designs 
for AI systems need not be simple because they 
need not be understandable to people. On the 
contrary, AI techniques such as preference rep- 
resentation, preference elicitation, and search 
algorithms can be used to turn the mathematical 
formalisms of mechanism design into concrete 
computational methods (40-42). The design prob- 
lem itself can also be usefully formulated as a 
computational problem, with optimization and 
machine learning used to find solutions to de- 
sign problems for which analytical solutions are 
unavailable (43-46). 

The prospect of an economy of Als has also 
inspired expansions to new mechanism design 
settings. Researchers have developed incentive- 
compatible multiperiod mechanisms, consider- 
ing such factors as uncertainty about the future 
and changes to agent preferences because of 
changes in local context (47-49). Another direc- 
tion considers new kinds of private inputs be- 
yond preference information (50, 57). For example, 
in a team formation setting, each AI might mis- 
report information about the capabilities of other 
Als in order to get itself selected for the team (52). 
Similarly, Als seeking to maximize task assign- 
ments might provide false reports of experience 
in task performance in order to mislead a learn- 
ing mechanism constructing an automatic task 
classifier (53). Systems of AIs can also create new 
challenges for mechanism design. One such chal- 
lenge is false-name bidding, where an AI exploits 
its ability to manage multiple identities. For ex- 
ample, it may gain resources more cheaply by 
dividing a request into a set of smaller requests, 
each placed from a different identity under its 
control. In response, researchers have developed 
mechanisms that are robust to this new kind of 
attack (54). 

The important role of mechanism design in an 
economy of Als can be observed in practice. Search 
engines run auctions to allocate ads to positions 
alongside search queries. Advertisers bid for their 
ads to appear in response to specific queries (e.g., 
“personal injury lawyer”). Ads are ranked according 
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to bid amount (as well as other factors, such as 
ad quality), with higher-ranked ads receiving a 
higher position on the search results page. Early 
auction mechanisms employed first-price rules, 
charging an advertiser its bid amount when its 
ad receives a click. Recognizing this, advertisers 
employed AIs to monitor queries of interest, 
ordered to bid as little as possible to hold onto 
the current position. This practice led to cascades 
of responses in the form of bidding wars, amounting 
to a waste of computation and market ineffi- 
ciency (55). To combat this, search engines in- 
troduced second-price auction mechanisms (37), 
which charge advertisers based on the next-highest 
bid price rather than their own price. This ap- 
proach (a standard idea of mechanism design) 
removed the need to continually monitor the bid- 
ding to get the best price for position, thereby end- 
ing bidding wars (56). 

In recent years, search engine auctions have 
supported richer, goal-based bidding languages. 


SP + Proxy 


The tangle between automated agents and the 
design of rules of interaction also features prom- 
inently in today’s financial markets, where the 
dominance of computerized traders has, by most 
accounts, qualitatively shaped the behavior of 
these markets. Although details of implementa- 
tion are closely held secrets, it is well understood 
that techniques from AI and machine learning 
are widely employed in the design and analysis 
of algorithmic traders (66). Algorithmic trading 
has enabled the deployment of strategies that 
exploit speed advantages and has led in turn to a 
costly arms race of measures to respond to market 
information with minimum latency. A proposed 
design response would replace continuous-time 
auctions with periodic auctions that clear on the 
order of once per second, thus negating the 
advantage of tiny speed improvements (67, 68). 

We describe two additional examples of the 
design of multi-agent systems for an economy 
of Als. The first example system aggregates 


Fig. 4. Two generations of sponsored search mechanisms. Early designs were first price (FP), and 
advertisers (ADV) used Als (Al-POS) to maintain a position on the list of search results at the lowest 
possible price. Second-price (SP) auction mechanisms were introduced, designed to replace the 
combination of FP and Al-POS. Advertisers adopted new Als (AIl-GOAL) to achieve higher-level goals 
such as to maximize profit or to maximize the number of clicks. The second price auction was extended to 
include proxy agents (SP+Proxy), designed to replace the combination of SP and Al-GOAL. 


For example, an advertiser can ask to maximize 
clicks over a weighted set of queries subject to a 
budget constraint (57, 58). Search engines pro- 
vide proxy agents that then bid on behalf of 
advertisers to achieve the stated goal (59). This 
introduction of proxy agents and the earlier 
switch from first price to second price can be 
interpreted as a computational application of a 
fundamental concept in mechanism design—the 
revelation principle (60-62). Briefly, this states 
that if the rules of a mechanism and the equil- 
ibrium strategies in that mechanism are replaced 
by a new mechanism that is functionally equiv- 
alent to the composition of these rules and strat- 
egies, then the new mechanism will be incentive 
compatible. Although neither redesign provides 
incentive compatibility in a formal sense, both 
second-pricing and proxy bidding can be inter- 
preted as accomplishing on behalf of advertisers 
what they were doing (through Als) in an earlier 
design (see Fig. 4). Still other ad platform de- 
signs are using a strategy-proof mechanism [the 
Vickrey-Clarke-Groves mechanism (37, 63, 64)] 
to make decisions about the space to allocate to 
ads, which ads to allocate, and which (nonspon- 
sored) content to display to a user (65). 


information held by multiple Als. The rules of a 
system that achieves this goal can be engineered 
purposefully through the design of a prediction 
market (69). Popular versions of prediction mar- 
kets feature questions such as who will be elected 
US. president (e.g., Betfair offers many such mar- 
kets). The basic idea of a prediction market is 
to facilitate trade in securities contracts (e.g., a 
possible contract will pay $1 if Hilary Clinton is 
elected). The price that balances supply and de- 
mand is then interpreted as a market prediction 
(e.g., price $0.60 reflects probability 0.6 for the 
payoff event). 

Consider a domain with a large number of in- 
terrelated random variables—for example, “flight 
BA214 delayed by more than 1 hour,” “snowstorm 
in Boston,” “de-icing machine fail,” “incoming 
flight BA215 delayed by more than 1 hour,” and 
“security alert in London.” In a combinatorial 
prediction market (70), a large bet on the con- 
tract “de-icing machine fail” would affect the 
price of “flight BA214 delayed by more than 1 hour” 
and all other connected events. A challenge is that 
the number of conceivable events is exponential in 
the number of random variables. Among other 
properties, a good market design should allow bets 
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Fig. 5. Ina reputation system for a multi-agent Al, each agent chooses an action, and the combined 
effect of these actions generates rewards (i.e., utility). Based on the actions taken and the rewards 
received, agent i can submit a report, xi, to the reputation system. The reputation system aggregates this 
feedback—for example, providing a ranked list to reflect the estimated trustworthiness of agents. Each 
agent observes this ranked list, and this information may influence future actions. 


on all events about which Als have information 
(e.g., “de-icing machine fail AND all subsequent 
flights from Boston delayed by more than 1 hour”). 
A good design should also align incentives—for 
example, making it utility-maximizing to trade im- 
mediately on current information until the market 
price reflects an agent’s belief. Progress in scaling 
up combinatorial markets has been made by rela- 
ting the problem of pricing bets to well-understood 
problems in statistical inference and convex opti- 
mization (71, 72). Related research advances are 
being made by allowing Als to transact in hypo- 
theses that are acquired through machine learning 
as well as trade directly in information signals 
rather than beliefs (73-75). 

The second example is the management of 
information concerning the trustworthiness of 
agents within an economy of Als. Trust that a 
counterparty will complete a transaction or in- 
vest effort or resources is crucial for any well- 
functioning economic system. A standard approach 
is to associate participants with a reputation, 
which can serve to align incentives in the present 
under the threat of a damaged reputation and 
lost opportunities in the future. In addition to 
this problem of moral hazard (ie., will agents 
behave cooperatively when completing economic 
transactions), reputation systems can address the 
problem of adverse selection (i.e., will high-quality 
agents choose to enter a market in the first place) 
(76, 77). 

A special challenge in an economy of Als arises 
because of the fluidity of identity and the ease 
with which agents can be replaced. This raises, 
for example, the specter of whitewashing attacks, 
where an AI repeatedly runs down its reputation 
before reentering with a different identity. With- 
out the possibility of enforcing strong identities 
that cannot be changed, this suggests a social 
cost of fluid identities, where it becomes neces- 


SCIENCE sciencemag.org 


sary to impose a penalty on all new participants 
and make them build up reputations from an 
assumption of being untrustworthy (78). 

We should also consider that machina eco- 
nomicus will be strategic in sharing feedback on 
other Als. For example, in eBay’s original repu- 
tation system, buyers were often reluctant to 
leave negative feedback about deadbeat sellers, 
because the sellers could retaliate with negative 
feedback about the buyer. In response, eBay 
introduced an additional feedback mechanism 
that was one-directional from the buyer to the 
seller and could not be easily traced to a parti- 
cular buyer. The change resulted in a greater 
amount of negative feedback (79). 

The economy of Als also offers positive oppor- 
tunities for promoting trust through bookkeeping, 
collecting feedback, and tracking the provenance 
of feedback in novel reputation mechanisms (see 
Fig. 5). AI researchers are designing reputation 
systems that align incentives with making truth- 
ful reports, while provably satisfying axiomatic 
properties such as symmetry: Two agents that 
are in an equivalent position from the perspec- 
tive of reports made and received should have 
the same trust score (80, 87). Another example 
is the design of accounting systems that elicit 
truthful reports about the resources contributed 
or work performed by other Als and enable the 
design of systems to mitigate free-riding and 
promote fair contributions to an economic sys- 
tem (82). Still, the extent to which effective, multi- 
agent AIs can be developed entirely through 
computational infrastructure such as reputation 
mechanisms and without recourse to legal sys- 
tems remains an interesting open question. 


Closing comments 


Whatever one’s thoughts about when or whether 
AI will transcend human-level performance, the 


rapidly advancing capabilities of AI are fueling 
considerable optimism and investment in AI 
research. AI has surpassed or will likely soon 
surpass humans in narrow domains such as 
playing chess, controlling a jumbo jet during 
cruise, making product recommendations, pric- 
ing millions of products on an eCommerce plat- 
form, reasoning about whether a patient is likely 
to be re-admitted to a hospital, and detecting sig- 
nals from a massive volume of financial news 
stories. 

Certainly, many fundamental challenges re- 
main, including how to design reasoning and 
inference methods that effectively balance the 
benefit of additional computation with the costs 
that may arise from additional delay to acting 
in the world and how to design AI systems that 
can learn and generalize from reward signals 
in unconstrained domains. Given that decision 
problems related to economic transactions are 
often relatively well structured, however, it seems 
likely to us that AI will continue to make espe- 
cially rapid inroads in economically important 
applications. This in turn will ensure continued 
effort on methods for rational, economic reason- 
ing toward the broader goal of developing ma- 
china economicus. 

We should not leave the impression that AI 
researchers unanimously embrace economic per- 
spectives on single- or multi-agent AI. For some, 
multi-agent economic models are still seen as a 
distraction. After all, a centralized perspective 
allows focusing on overall goals without worry- 
ing about the incentives of individual parts of 
the system. Others conduct research into multi- 
agent systems composed of agents under the 
control of the designer, so that they can be pro- 
grammed in any way desired. Just as with cen- 
tralized solutions, these so-called “cooperative” 
multi-agent systems allow design without con- 
cern for the self-interest of individual agents, 
albeit often with decomposition or communi- 
cation constraints. But cooperative versus self- 
interested is really a difference in assumptions 
on the power of a system designer, rather than 
a technical dispute. The viewpoint that we as- 
cribe to is that a large number of AI systems will, 
given the existing structure of human economic 
systems, be populated by Als that are designed, 
deployed, owned, and operated by a myriad of 
different parties, each with possibly misaligned 
goals. Finally, some may object to the economic 
approach on the basis that Als are and will re- 
main far from perfectly rational, simply by vir- 
tue of physical and computational limits. More 
direct models of the Als’ computational behav- 
ior, in terms of the automata they are, could in 
principle be more accurate. The analytical util- 
ity of a rationality abstraction for Als is ulti- 
mately an empirical question to be resolved as 
AI progresses. 

Among those adopting an economic approach, 
there persist some disagreements on specific 
techniques—for example, on the role of equilib- 
rium reasoning. Even if agents can be viewed 
as rational, some question whether it is plausi- 
ble that they reach equilibrium configurations, 
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particularly in situations where multiple equilib- 
ria exist. As Shoham (83) argues, game theory 
lacks a well-accepted pragmatic account of how 
it should be deployed in concrete reasoning con- 
texts. A positive view is that AI researchers, in 
their efforts to operationalize economic reason- 
ing, are developing exactly this needed body of 
pragmatics. 

Some may object that mechanism design is too 
idealized even for systems of AIs—for example, in 
its insistence on design under equilibrium behav- 
ior, its assumption that rules of interaction can be 
designed from scratch, and its lack of attention to 
the details of the human and legal contexts in 
which designed systems will operate. A positive 
view is that AI systems are precisely the kinds of 
environments where we can build tabula rasa 
new rules of interaction, because these rules will 
be realized through the Internet and as programs 
running on computer servers. That such rules of 
interaction can come into existence is as much a 
matter of science and engineering as it is of 
public policy. 

As AI advances, we are confident that econo- 
mic reasoning will continue to have an impor- 
tant role in the design of single-agent and 
multi-agent AIs, and we have argued that, as 
economies of Als continue to emerge, there will 
need to be a new science to understand how to 
design these systems. These Als will no doubt 
exert strong forces on the economy and broader 
society; understanding the effect and extent of 
this will shape the research agendas of both AI 
and economics in years to come. 


REFERENCES AND NOTES 


1. S. Russell, P. Norvig, Artificial Intelligence: A Modern Approach 
(Prentice Hall, ed. 3, 2009). 

2. J. Doyle, Comput. Intell. 8, 376-409 (1992). 

3. E. J. Horvitz, Computation and action under bounded 
resources, thesis, Stanford University (1990). 

4. S. Russell, in Fundamental Issues of Artificial Intelligence, 

V. C. Muller, Ed. (Springer, Berlin, 2015). 
5. J. S. Rosenschein, G. Zlotkin, Rules of Encounter: Designing 
Conventions for Automated Negotiation among Computers 
(MIT Press, Cambridge, 1994). 
6. M. McCue, Can't secure a reservation to a hot restaurant? Try 
using an algorithm, Fortune, 3 September 2014; http:// 
fortune.com/2014/09/03/opentable-dinematic-tablesweep- 
restaurant-reservation-service/. 
7. J.D. Sutter, CNN, 25 April 2011; http://edition.cnn.com/2011/ 
TECH/web/04/25/amazon.price.algorithm. 
8. T. Hendershott, C. M. Jones, A. J. Menkveld, J. Finance 66, 
1-33 (2011). 
9. D. J. Abraham, A. Blum, T. Sandholm, 8th ACM Conference on 
Electronic Commerce (2007), pp. 295-304. 
0. A. Frechette, N. Newman, K. Leyton-Brown, 24th International 
Joint Conference on Artificial Intelligence (2015). 

1. S. D. Ramchurn, P. Vytelingum, A. Rogers, N. R. Jennings, 
Commun. ACM 55, 86-97 (2012). 

2. R. |. Brafman, M. Tennenholtz, Artif. Intell. 159, 27-47 
(2004). 

3. A. X. Jiang et al., Adv. Neural Inf. Process. Syst. 27, 2573-2581 
(2014). 

A. A. Y. Ng, D. Harada, S. J. Russell, 16th International Conference 
on Machine Learning (1999), pp. 278-287. 

5. S. Singh, R. L. Lewis, A. G. Barto, J. Sorg, IEEE Transactions on 
Autonomous Mental Development 2, 70-82 (2010). 

6. L. Getoor, B. Taskar, Eds., Introduction to Statistical Relational 
Learning (MIT Press, Cambridge, 2007). 

7. M. |. Jordan, T. M. Mitchell, Science 349, 255-260 
(2015). 

8. J. Hirschberg, C. D. Manning, Science 349, 261-266 
(2015). 


272 17JULY 2015 + VOL 349 ISSUE 6245 


19. 


20. 


2. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 
29. 


30. 


31. 


32. 
33: 


34. 


35. 
36. 
37. 


38. 
39. 


40. 


4). 


42. 
43. 


4A. 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


Y. Shoham, K. Leyton-Brown, Multiagent Systems: Algorithmic, 
Game-Theoretic and Logical Foundations (Cambridge Univ. 
Press, Cambridge, 2009). 

D. Fudenberg, D. K. Levine, The Theory of Learning in Games 
(MIT Press, Cambridge, 1998). 

M. P. Wellman, D. M. Reeves, K. M. Lochner, Y. Vorobeychik, 
J. Artif. Intell. Res. 21, 19-36 (2004). 

D. Billings, A. Davidson, J. Schaeffer, D. Szafron, Artif. Intell. 
134, 201-240 (2002). 

M. Bowling, N. Burch, M. Johanson, 0. Tammelin, Science 347, 
45-149 (2015). 

. Sandholm, Al Mag. 31, 13-32 (2010). 

N. Brown, S. Ganzfried, T. Sandholm, 14th International 
Conference on Autonomous Agents and Multi-Agent Systems 
(2015), pp. 7-15. 
R. Gibson, Regret minimization in games and the development 
of champion multiplayer computer poker-playing agents, 
Ph.D. thesis, University of Alberta (2014). 
M. Tambe, Security and Game Theory: Algorithms, Deployed 
Systems, Lessons Learned (Cambridge Univ. Press, Cambridge, 
2011). 
Y. Gal, A. Pfeffer, J. Artif. Intell. Res. 33, 109-147 (2008). 

C. F. Camerer, Behavioral Game Theory: Experiments in 
Strategic Interaction (Princeton Univ. Press, Princeton, 

2003). 

R. Yang, C. Kiekintveld, F. Orddfiez, M. Tambe, R. John, Artif. 
Intell. 195, 440-469 (2013). 

. R. Wright, K. Leyton-Brown, Evaluating, understanding, and 
improving behavioral game theory models for predicting 
human behavior in unrepeated normal-form games, CoRR abs/ 
306.0918 (2013); http://arxiv.org/abs/1306.0918. 

D. Monderer, M. Tennenholtz, Artif. Intell. 173, 180-195 (2009). 
L. Hurwicz, in Mathematical Methods in the Social Sciences, 

K. J. Arrow, S. Karlin, P. Suppes, Eds. (Stanford University 
Press, Stanford 1960), Ch. 3, pp. 27-46. 

T. Sénmez, U. Unver, in Handbook of Social Economics, A. Bisin, 
J. Benhabib, M. Jackson, Eds. (North-Holland, 2011), vol. 1A, 

pp. 781-852. 

P. Milgrom, Putting Auction Theory to Work (Cambridge Univ. 
Press, Cambridge, 2004). 

H. R. Varian, Ist USENIX Workshop on Electronic Commerce 
(1995), pp. 13-21. 

W. Vickrey, J. Finance 16, 8-37 (1961). 

Y. Chen, T. Sénmez, J. Econ. Theory 127, 202-231 (2006). 

F. Echenique, A. J. Wilson, L. Yariv, Clearinghouses for 
wo-sided matching: An experimental study, Working Papers 
487, University of Pittsburgh, Department of Economics 
(2013). 
N. Nisan, in Combinatorial Auctions, P. Cramton, 

Y. Shoham, R. Steinberg, Eds. (MIT Press, Cambridge, 

2006), chap. 9. 

T. Sandholm, C. Boutilier, in Combinatorial Auctions, P. Cramton, 
Y. Shoham, R. Steinberg, Eds. (MIT Press, Cambridge, 2006), 
chap. 10. 
T. Sandholm, Artif. Intell. 135, 1-54 (2002). 

V. Conitzer, T. Sandholm, 18th Conference on Uncertainty in 
Artificial Intelligence (2002), pp. 103-110. 

S. Alaei, H. Fu, N. Haghpanah, J. D. Hartline, A. Malekian, 13th 
ACM Conference on Electronic Commerce (2012), p. 17. 

Y. Cai, C. Daskalakis, S. M. Weinberg, 54th Annual IEEE 
Symposium on Foundations of Computer Science (2013), 

pp. 618-627. 

P. Duetting et al., ACM Transactions on Economics and 
Computation 3, 5:1-5:41 (2015). 

D. C. Parkes, S. Singh, Adv. Neural Inf. Process. Syst. 16, 
791-798 (2003). 

R. Cavallo, D. C. Parkes, S. Singh, 22nd Conference on 
Uncertainty in Artificial Intelligence (Cambridge, MA, 2006), 
pp. 55-62. 

D. C. Parkes, in Algorithmic Game Theory, N. Nisan, 

. Roughgarden, E. Tardos, V. Vazirani, Eds. (Cambridge Univ. 
Press, 2007), chap. 16, pp. 411-439. 

Y. Shoham, M. Tennenholtz, Theor. Comput. Sci. 343, 97-113 
(2005). 

J. Y. Halpern, V. Teague, 36th Annual ACM Symposium on 
heory of Computing (2004), pp. 623-632. 

N. Alon, F. Fischer, A. Procaccia, M. Tennenholtz, 13th 
Conference on Theoretical Aspects of Rationality and 
Knowledge (2011), pp. 101-110. 

O. Dekel, F. A. Fischer, A. D. Procaccia, J. Comput. Syst. Sci. 
76, 759-777 (2010). 

M. Yokoo, Y. Sakurai, S. Matsubara, Games Econ. Behav. 46, 
174-188 (2004). 


55. 


56. 


oT: 


58. 


59. 


60. 


61. 


62. 
63. 
64. 
65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


74. 


75. 


76. 


77. 


78. 


79. 


80. 


81. 


82. 


83. 
84. 


85. 
86. 


87. 


88. 


89. 


90. 


ACKNOWLEDGMENTS 


B. Edelman, M. Ostrovsky, Decis. Support Syst. 43, 192-198 
(2007). 
B. Edelman, M. Ostrovsky, M. Schwarz, Am. Econ. Rev. 97, 
242-259 (2007). 
C. Borgs et al., 16th International Conference on World-Wide 
Web (2007), pp. 531-540. 

J. Feldman, S. Muthukrishnan, M. Pal, C. Stein, 8th 

ACM Conference on Electronic Commerce (2007), 

pp. 40-49. 
A. Z. Broder, E. Gabrilovich, V. Josifovski, G. Mavromatis, 

A. J. Smola, 4th International Conference on Web Search 

and Web Data Mining (2011), pp. 515-524. 

L. Hurwicz, in Decision and Organization, R. Radner, 

C. B. McGuire, Eds. (North-Holland, Amsterdam, 1972), 

Ch. 14, pp. 297-336. 

A. Gibbard, Econometrica 41, 587-602 (1973). 

R. Myerson, Econometrica 47, 61-73 (1979). 

E. Clarke, Public Choice 11, 17-33 (1971). 

T. Groves, Econometrica 41, 617-631 (1973). 

H. R. Varian, C. Harris, Am. Econ. Rev. 104, 442-445 
(2014). 

M. Kearns, Y. Nevmyvaka, in High Frequency Trading: 

New Realities for Traders, Markets and Regulators, D. Easley, 
M. Lopez de Prado, M. O'Hara, Eds. (Risk Books, London, 
2013), Ch. 5, pp. 91-124. 

E. Budish, P. Cramton, J. Shim, The high-frequency trading 
arms race: Frequent batch auctions as a market design 
response, Tech. Rep. 14-03, Booth School of Business, 
University of Chicago (2015). 

E. Wah, M. P. Wellman, 14th ACM Conference on Electronic 
Commerce (2013), pp. 855-872. 

R. Forsythe, F. Nelson, G. R. Neumann, J. Wright, in 
Contemporary Laboratory Experiments in Political Economy, 

T. R. Palfrey, Ed. (University of Michigan Press, Ann Arbor, 
1991), pp. 69-111. 

R. D. Hanson, Inf. Syst. Front. 5, 107-119 (2003). 

J. Abernethy, Y. Chen, J. W. Vaughan, ACM Transactions on 
Economics and Computation 1, 12:1-12:39 (2013). 

M. Dudik, S. Lahaie, D. M. Pennock, D. Rothschild, 14th 

ACM Conference on Electronic Commerce (2013), 

pp. 341-358. 

J. Abernethy, R. Frongillo, Adv. Neural Inf. Process. Syst. 24, 
2600-2608 (2011). 

J. Witkowski, D. C. Parkes, 13th ACM Conference on Electronic 
Commerce (2012), pp. 964-981. 

J. Hu, A. J. Storkey, 31st International Conference on Machine 
Learning (2014), pp. 1773- 1781. 

P. Resnick, K. Kuwabara, R. Zeckhauser, E. J. Friedman, 
Commun. ACM 43, 45-48 (2000). 

C. Dellarocas, Manage. Sci. 49, 1407-1424 (2003). 

E. J. Friedman, P. Resnick, J. Econ. Manage. Strategy 10, 
173-199 (2001). 

G. Bolton, B. Greiner, A. Ockenfels, Manage. Sci. 59, 265-285 
(2013). 

A. Cheng, E. Friedman, ACM SIGCOMM Workshop on 
Economics of Peer-to-Peer Systems (2005), pp. 128-132. 

A. Altman, M. Tennenholtz, J. Artif. Intell. Res. 31, 473-495 
(2008). 

S. Seuken, J. Tang, D. C. Parkes, 24th AAAI Conference on 
Artificial Intelligence (2010), pp. 860-866. 

Y. Shoham, Commun. ACM 51, 74-79 (2008). 

J. Sorg, S. Singh, R. Lewis, 27th International Conference on 
Machine Learning (2010), pp. 1007-1014. 

D. Koller, A. Pfeffer, Artif. Intell. 94, 167-215 (1997). 

D. Billings et al., 18th International Joint Conference on 
Artificial Intelligence (2003), pp. 661-668. 

A. Gilpin, T. Sandholm, 7th ACM Conference on Electronic 
Commerce (2006), pp. 160-169. 

M. Zinkevich, M. Johanson, M. Bowling, C. Piccione, Adv. Neural 
Inf. Process. Syst. 20, 905-912 (2007). 

A. Gilpin, S. Hoda, J. Pefia, T. Sandholm, 3rd 

International Workshop on Internet and Network 

Economics (2007), pp. 57-69. 
S. Hoda, A. Gilpin, J. Pefia, T. Sandholm, Math. Oper. Res. 35, 
494-512 (2010). 


Thanks to the anonymous referees and to M. Bowling, Y. Chen, 
K. Gal, S. Lahaie, D. Pennock, A. Procaccia, T. Sandholm, 


S. Seuken, Y. Shoham, A. Storkey, M. 


ambe, and M. Tennenholtz 


for their thoughtful comments on an earlier draft. 


10.1126/science.aaa8403 


sciencemag.org SCIENCE 


REVIEW 


Computational rationality: A 
converging paradigm for intelligence 
in brains, minds, and machines 


Samuel J. Gershman,” Eric J. Horvitz,?* Joshua B. Tenenbaum** 


After growing up together, and mostly growing apart in the second half of the 20th century, 
the fields of artificial intelligence (Al), cognitive science, and neuroscience are 
reconverging on a shared view of the computational foundations of intelligence that 
promotes valuable cross-disciplinary exchanges on questions, methods, and results. 

We chart advances over the past several decades that address challenges of perception 
and action under uncertainty through the lens of computation. Advances include the 
development of representations and inferential procedures for large-scale probabilistic 
inference and machinery for enabling reflection and decisions about tradeoffs in effort, 
precision, and timeliness of computations. These tools are deployed toward the goal of 
computational rationality: identifying decisions with highest expected utility, while 

taking into consideration the costs of computation in complex real-world problems in 
which most relevant calculations can only be approximated. We highlight key concepts with 
examples that show the potential for interchange between computer science, cognitive 


science, and neuroscience. 


magine driving down the highway on your 
way to give an important presentation, when 
suddenly you see a traffic jam looming ahead. 
In the next few seconds, you have to decide 
whether to stay on your current route or take 
the upcoming exit—the last one for several miles— 
all while your head is swimming with thoughts 
about your forthcoming event. In one sense, this 
problem is simple: Choose the path with the 
highest probability of getting you to your event 
on time. However, at best you can implement 
this solution only approximately: Evaluating the 
full branching tree of possible futures with high 
uncertainty about what lies ahead is likely to be 
infeasible, and you may consider only a few of 
the vast space of possibilities, given the urgency 
of the decision and your divided attention. How 
best to make this calculation? Should you make a 
snap decision on the basis of what you see right 
now, or explicitly try to imagine the next several 
miles of each route? Perhaps you should stop 
thinking about your presentation to focus more 
on this choice, or maybe even pull over so you 
can think without having to worry about your 
driving? The decision about whether to exit has 
spawned a set of internal decision problems: how 
much to think, how far should you plan ahead, 
and even what to think about. 
This example highlights several central themes 
in the study of intelligence. First, maximizing some 
measure of expected utility provides a general- 
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purpose ideal for decision-making under uncer- 
tainty. Second, maximizing expected utility is 
nontrivial for most real-world problems, necessi- 
tating the use of approximations. Third, the choice 
of how best to approximate may itself be a decision 
subject to the expected utility calculus—thinking 
is costly in time and other resources, and some- 
times intelligence comes most in knowing how 
best to allocate these scarce resources. 

The broad acceptance of guiding action with 
expected utility, the complexity of formulating 
and solving decision problems, and the rise of 
approximate methods for multiple aspects of 
decision-making under uncertainty has motivated 
artificial intelligence (AI) researchers to take a 
fresh look at probability through the lens of com- 
putation. This examination has led to the devel- 
opment of computational representations and 
procedures for performing large-scale probabil- 
istic inference; methods for identifying best ac- 
tions, given inferred probabilities; and machinery 
for enabling reflection and decision-making about 
tradeoffs in effort, precision, and timeliness of 
computations under bounded resources. Analo- 
gous ideas have come to be increasingly important 
in how cognitive scientists and neuroscientists 
think about intelligence in human minds and 
brains, often being explicitly influenced by AI re- 
searchers and sometimes influencing them back. 
In this Review, we chart this convergence of ideas 
around the view of intelligence as computational 
rationality: computing with representations, algo- 
rithms, and architectures designed to approximate 
decisions with the highest expected utility, while 
taking into account the costs of computation. We 
share our reflections about this perspective on 
intelligence, how it encompasses interdiscipli- 
nary goals and insights, and why we think it will 
be increasingly useful as a shared perspective. 


Models of computational rationality are built 
on a base of inferential processes for perceiving, 
predicting, learning, and reasoning under uncer- 
tainty (J-3). Such inferential processes operate on 
representations that encode probabilistic depen- 
dencies among variables capturing the likelihoods 
of relevant states in the world. In light of incom- 
ing streams of perceptual data, Bayesian updating 
procedures or approximations are used to prop- 
agate information and to compute and revise 
probability distributions over states of variables. 
Beyond base processes for evaluating probabil- 
ities, models of computational rationality require 
mechanisms for reasoning about the feasibility 
and implications of actions. Deliberation about 
the best action to take hinges on an ability to make 
predictions about how different actions will in- 
fluence likelihoods of outcomes and a considera- 
tion of the value or utilities of the outcomes (4). 
Learning procedures make changes to parame- 
ters of probabilistic models so as to better explain 
perceptual data and provide more accurate in- 
ferences about likelihoods to guide actions in 
the world. 

Last, systems with bounded computational 
power must consider important tradeoffs in the 
precision and timeliness of action in the world. 
Thus, models of computational rationality may 
include policies or deliberative machinery that 
make inferences and decisions at the “metalevel” 
in order to regulate base-level inferences. These 
decisions rely on reflection about computational 
effort, accuracy, and delay associated with the 
invocation of different base-level algorithms in dif- 
ferent settings. Such metalevel decision-making, 
or “metareasoning,” can be performed via real- 
time reflection or as policies computed during 
offline optimizations. Either way, the goal is to 
identify configurations and uses of base-level pro- 
cesses with the goal of maximizing the expected 
value of actions taken in the world. These com- 
putational considerations become increasingly 
important when we consider richer representa- 
tions (graphs, grammars, and programs) that sup- 
port signature features of human intelligence, such 
as recursion and compositionality (5). 

Key advances in AI on computational machin- 
ery for performing inference, identifying ideal 
actions, and deliberating about the end-to-end 
operation of systems have synergies and reso- 
nances with human cognition. After a brief his- 
tory of developments in AI, we consider links 
between computational rationality and findings 
in cognitive psychology and neuroscience. 


Foundations and early history 


AI research has its roots in the theory of compu- 
tability developed in the 1930s. Efforts then high- 
lighted the power of a basic computing system 
(the Turing Machine) to support the real-world 
mechanization of any feasible computation (6). 
The promise of such general computation and the 
fast-paced rise of electronic computers fueled the 
imagination of early computer scientists about 
the prospect of developing computing systems 
that might one day both explain and replicate 
aspects of human intelligence (7, 8). 
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Fig. 1. Examples of modern Al systems that use approximate inference and decision-making. 
These systems cannot rely on exhaustive enumeration of all relevant utilities and probabilities. Instead, 
they must allocate computational resources (including time and energy) to optimize approximations 
for inferring probabilities and identifying best actions. (A) The internal state of IBM Watson as it 


[Permission from Microsoft] 


Early pioneers in AI reflected about uses of 
probability and Bayesian updating in learning, 
reasoning, and action. Analyses by Cox, Jaynes, 
and others had provided foundational argu- 
ments for probability as a sufficient measure 
for assessing and revising the plausibility of 
events in light of perceptual data. In influential 
work, von Neumann and Morgenstern published 
results on utility theory that defined ideal, or 
“rational,” actions for a decision-making agent 
(4). They presented an axiomatic formalization 
of preferences and derived the “principle of max- 
imum expected of utility” (MEU). Specifically, 
they showed that accepting a compact and com- 
pelling set of desiderata about preference order- 
ings implies that ideal decisions are those actions 
that maximize an agent’s expected utility, which is 
computed for each action as the average utility 
of the action when considering the probability 
of states of the world. 

The use of probability and MEU decision- 
making soon pervaded multiple disciplines, in- 
cluding some areas of AI research, such as projects 
in robotics. However, the methods did not gain 
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plays Jeopardy!, representing a few high-probability hypotheses. [Photo by permission of IBM News 
Room] (B) The internal state of the Google self-driving car, which represents those aspects of the world 
that are potentially most valuable or costly for the agent in the foreseeable future, such as the po- 
sitions and velocities of the self-driving car, other cars and pedestrians, and the state of traffic signals. 
[Reprinted with permission from Google] (C) The Assistant (left), an interactive automated secretary 
fielded at Microsoft Research, recognizes multiple people in its proximity (right); deliberates about their 
current and future goals, attention, and utterances; and engages in natural dialog under uncertainty. 


a large following in studies of AI until the late 
1980s. For decades after the work by von Neumann 
and Morgenstern, probabilistic and decision- 
theoretic methods were deemed by many in the 
AI research community to be too inflexible, sim- 
plistic, and intractable for use in understand- 
ing and constructing sophisticated intelligent 
systems. Alternative models were explored, in- 
cluding logical theorem-proving and various 
heuristic procedures. 

In the face of the combinatorial complexity 
of formulating and solving real-world decision- 
making, a school of research on heuristic mod- 
els of bounded rationality blossomed in the 
later 1950s. Studies within this paradigm include 
the influential work of Simon and colleagues, 
who explored the value of informal, heuristic 
strategies that might be used by people—as 
well as by computer-based reasoning systems— 
to cut through the complexity of probabilistic 
inference and decision-making (9). The per- 
spective of such heuristic notions of bounded 
rationality came to dominate a large swath of 
AI research. 


Computational lens on probability 

In the late 1980s, a probabilistic renaissance 
swept through mainstream AI research, fueled 
in part by pressures for performing sound in- 
ference about likelihoods of outcomes in appli- 
cations of machine reasoning to such high-stakes 
domains as medicine. Attempts to mechanize 
probability for solving challenges with inference 
and learning led to new insights about proba- 
bility and stimulated thinking about the role of 
related representations and inference strategies 
in human cognition. Perhaps most influentially, 
advances in AI led to the formulation of rich 
network-based representations, such as Bayesian 
networks, broadly referred to as probabilistic 
graphical models (PGMs) (J, 2). Belief updating 
procedures were developed that use parallel and 
distributed computation to update constellations 
of random variables in the networks. 

The study of PGMs has developed in numer- 
ous directions since these initial advances: effi- 
cient approximate inference methods; structure 
search over combinatorial spaces of network struc- 
tures; hierarchical models for capturing shared 
structure across data sets; active learning to 
guide the collection of data; and probabilistic 
programming tools that can specify rich, context- 
sensitive models via compact, high-level pro- 
grams. Such developments have put the notions 
of probabilistic inference and MEU decision- 
making at the heart of many contemporary AI 
approaches (3) and, together with ever-increasing 
computational power and data set availability, 
have been responsible for dramatic AI successes 
in recent years (such as IBM’s Watson, Google’s 
self-driving car, and Microsoft’s automated as- 
sistant). These developments also raise new com- 
putational and theoretical challenges: How can 
we move from the classical view of a rational agent 
who maximizes expected utility over an exhaus- 
tively enumerable state-action space to a theory 
of the decisions faced by resource-bounded AI 
systems deployed in the real world (Fig. 1), which 
place severe demands on real-time computation 
over complex probabilistic models? 


Rational decisions under bounded 
computational resources 


Perception and decision-making incur computa- 
tional costs. Such costs may be characterized in 
different ways, including losses that come with 
delayed action in time-critical settings, interfer- 
ence among multiple inferential components, 
and measures of effort invested. Work in AI has 
explored the value of deliberating at the meta- 
level about the nature and extent of perception 
and inference. Metalevel analyses have been aimed 
at endowing computational systems with the 
ability to make expected utility decisions about 
the ideal balance between effort or delay and 
the quality of actions taken in the world. The use 
of such rational metareasoning plays a central 
role in decision-theoretic models of bounded ra- 
tionality (10-14). 

Rational metareasoning has been explored in 
multiple problem areas, including guiding computa- 
tion in probabilistic inference and decision-making 
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(i, 13, 14), controlling theorem proving (15), 
handling proactive inference in light of incom- 
ing streams of problems (16), guiding heuristic 
search (13, 17), and optimizing sequences of ac- 
tion (18-20). Beyond real-time metareasoning, 
efforts have explored offline analysis to learn 
and optimize policies for guiding real-time meta- 
reasoning and for enhancing real-time inference 
via such methods as precomputing and caching 
portions of inference problems into fast-response 
reflexes (21). 

The value of metalevel reflection in computa- 
tional rationality is underscored by the complex- 
ity of probabilistic inference in Bayesian networks, 
which has been shown to be in the nondeter- 
ministic polynomial-time (NP)-hard complexity 
class (22). Such worst-case complexity highlights 
the importance of developing approximations 
that exploit the structure of real-world problems. 
A tapestry of approximate inferential methods 
have been developed, including procedures that 
use Monte Carlo simulation, bounding methods, 
and methods that decompose problems into 
simpler sets of subproblems (7). Some methods 
allow a system to trade off computation time 
for accuracy. For example, sampling procedures 
can tighten the bounds on probabilities of inter- 
est with additional computation time. Charac- 
terizations of the tradeoffs can be uncertain in 
themselves. Other approaches to approximation 
consider tradeoffs incurred with modulating the 
complexity of models, such as changing the size 
of models and the level of abstraction of evi- 
dence, actions, and outcomes considered (JI, 21). 

A high-level view of the interplay between the 
value and cost of inference at different levels of 
precision is captured schematically in Fig. 2A. 


Here, the value of computing with additional 
precision on final actions and cost of delay for 
computation are measured in the same units of 
utility. A net value of action is derived as the 
difference between the expected value of action 
based on a current analysis and the cost of com- 
putation required to attain the level of analysis. 
In the situation portrayed, costs increase in a 
linear manner with a delay for additional com- 
putation, while the value of action increases with 
decreasing marginal returns. We see the attain- 
ment of an optimal stopping time, in which at- 
tempts to compute additional precision come at 
a net loss in the value of action. As portrayed in 
the figure, increasing the cost of computation 
would lead to an earlier ideal stopping time. In 
reality, we rarely have such a simple economics 
of the cost and benefits of computation. We are 
often uncertain about the costs and the expected 
value of continuing to compute and so must 
solve a more sophisticated analysis of the ex- 
pected value of computation. A metalevel rea- 
soner considers the current uncertainties, the 
time-critical losses with continuing computation, 
and the expected gains in precision of reasoning 
with additional computation. 

As an example, consider a reasoning system 
that was implemented to study computational 
rationality for making inferences and providing 
recommendations for action in time-critical med- 
ical situations. The system needs to consider the 
losses incurred with increasing amounts of delay 
with action that stems from the time required for 
inference about the best decision to take in a 
setting. The expected value of the best decision 
may diminish as a system deliberates about a 
patient’s symptoms and makes inferences about 


physiology. A trace of a reasoning session guided 
by rational metareasoning of a time-critical 
respiratory situation in emergency medicine is 
shown in Fig. 2B (4). An inference algorithm 
(named Bounded Conditioning) continues to 
tighten the upper and lower bounds on a critical 
variable representing the patient’s physiology, 
using a Bayesian network to analyze evidence. 
The system is uncertain about the patient’s state, 
and each state is associated with a different time 
criticality and ideal action. The system continues 
to deliberate at the metalevel about the value of 
continuing to further tighten the bounds. It mon- 
itors this value via computation of the expected 
value of computation. When the inferred ex- 
pected value of computation goes to zero, the 
metalevel analysis directs the base-level system 
to stop and take the current best inferred base- 
level action possible. 


Computational rationality in mind 
and brain 


In parallel with developments in AI, the study 
of human intelligence has charted a similar 
progression toward computational rationality. 
Beginning in the 1950s, psychologists proposed 
that humans are “intuitive statisticians,” using 
Bayesian decision theory to model intuitive 
choices under uncertainty (23). In the 1970s and 
1980s, this hypothesis met with resistance from 
researchers who uncovered systematic fallacies 
in probabilistic reasoning and decision-making 
(24), leading some to adopt models based on in- 
formal heuristics and biases rather than norma- 
tive principles of probability and utility theory 
(25). The broad success of probabilistic and 
decision-theoretic approaches in AI over the past 
two decades, however, has helped 
to return these ideas to the center 
of cognitive modeling (5, 26-28). 
The development of methods for 
approximate Bayesian updating via 
distributed message passing over 
large networks of variables suggests 
that similar procedures might be 
used for large-scale probabilistic in- 
ference in the brain (29). At the same 
time, researchers studying human 
judgment and decision-making con- 
tinue to uncover ways in which peo- 
ple’s cognitive instincts appear far 
from the MEU ideals that econo- 
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Fig. 2. Economics of thinking in computational rationality. (A) Systems must consider the expected value and 
cost of computation. Flexible computational procedures allow for decisions about ideal procedures and stopping 
times (t*) in order to optimize the net value of action (u*). In the general case, the cost-free value associated with 
obtaining a computed result at increasing degrees of precision and the cost of delay with computation are 
uncertain (indicated by the bell curve representing a probability distribution). Thus, the time at which further 
refinement of the inference should stop and action should be taken in the world are guided by computation of the 
expected value of computation. [Adapted from (16) with permission] (B) Trace of rational metareasoning in a 
time-critical medical setting. [Adapted from (14) with permission] A bounding algorithm continues to tighten the 
upper bound (UB) and lower bound (LB) on an important variable representing a patient’s physiology. When a 
continually computed measure of the value of additional computation (red line) goes to zero, the base-level model 
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40 hoped for. 

Computational rationality offers 
a framework for reconciling these 
contradictory pictures of human in- 
telligence. If the brain is adapted to 
compute rationally with bounded 
resources, then “fallacies” may arise 
as a natural consequence of this 
optimization (30). For example, a 
generic strategy for approximating 
Bayesian inference is by sampling 
hypotheses, with the sample-based 
approximation converging to the true 
posterior as more hypotheses are 
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mental parsing of a garden-path sentence. [Adapted from (36)] (Top) Evo- higher confidence) after some time interval A, for a particle-filter object 


lution of the posterior probability for two different syntactic parses (shown 
in boxes). The initially favored parse is disfavored by the end of the sen- 
tence. (Bottom) A resource-constrained probabilistic parser (based on a 
particle filter with few particles) may eliminate the initially unlikely parse 
and therefore fail to correctly parse the sentence by the end, as shown by 
the proportion of particle filters with 20 particles that successfully parse 
the sentence up to each word. (B) Sample-based inference for multiple- 
object tracking. In this task, subjects are asked to track a subset of dots 
over time, marked initially in red. Lines denote velocity, and circles denote 
uncertainty about spatial transitions. In the second frame, red shading 


tracker. Uncertainty scales with velocity, explaining why high-velocity ob- 
jects are harder to track (32). (©) For a sampling-based approximate Bayesian 
decision-maker, facing a sequence of binary choices, expected utility per 
decision, and number of decisions per unit time can be combined to 
compute the expected utility per unit time as a function of the number of 
posterior samples and action/sample cost ratios. Circles in the rightmost 
graph indicate the optimal number of samples at a particular action/sample 
cost ratio. For many decisions, the optimal choice tradeoff between accuracy 
and computation time suggests deciding after only one sample. [Reprinted 
from (38) with permission] 


sampled (7). Evidence suggests that humans 
use this strategy across several domains, includ- 
ing causal reasoning (37), perception (32, 33), 
and category learning (34). Sampling algorithms 
can also be implemented in biologically plausible 
neural circuits (35), providing a rational expla- 
nation for the intrinsic stochasticity of neurons. 

We see a correspondence between the sam- 
pling algorithms humans appear to use and those 
used in state-of-the-art AI systems. For example, 
particle filters—sequential sampling algorithms 
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for tracking multiple objects moving in a dynamic 
uncertain environment (36)—are at the heart of 
the Google self-driving car’s picture of its sur- 
roundings (Fig. 1B) and also may describe how 
humans track multiple objects (Fig. 3B) (32). 
When only a small number of hypotheses are 
sampled, various biases emerge that are con- 
sistent with human behavior. For instance, “gar- 
den path” effects in sentence processing, in 
which humans perseverate on initially promising 
hypotheses that are disconfirmed by subse- 


quent data, can be explained by particle filters 
for approximate online parsing in probabilistic 
grammars (Fig. 3A) (37). These biases may in 
fact be rational under the assumption that sam- 
pling is costly and most gains or losses are small, 
as in many everyday tasks; then, utility can be 
maximized by sampling as few as one or a few 
high-posterior probability hypotheses for each 
decision (Fig. 3C) (38). 

This argument rests crucially on the assertion 
that the brain is equipped with metareasoning 
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mechanisms sensitive to the costs of cognition. 
Some such mechanisms may take the form of 
heuristic policies hardwired by evolutionary 
mechanisms; we call these “heuristic” because 
they would be metarational only for the range 
of situations that evolution has anticipated. There 
is also evidence that humans have more adaptive 
metareasoning mechanisms sensitive to the costs 
of cognition in online computation. In recent 
work with the “demand selection” task (39-41), 
participants are allowed to choose between two 
cognitive tasks that differ in cognitive demand 
and potential gains. Behavioral findings show 
that humans trade off reward and cognitive ef- 
fort rationally according to a joint utility function 
(40). Brain imaging of the demand selection task 
has shown that activity in the lateral prefrontal 
cortex, a region implicated in the regulation of 
cognitive control, correlates with subjective re- 
ports of cognitive effort and individual differences 
in effort avoidance (41). 

Several recent studies have provided support 
for rational metareasoning in human cognition 
when computational cost and reward tradeoffs 
are less obvious (42, 43). As an example, humans 
have been found to consistently choose list- 
sorting strategies that rationally trade time and 
accuracy for a particular list type (42). This study 
joins earlier work that has demonstrated adap- 
tive strategy selection in humans (44, 45) but 
goes beyond them by explicitly modeling strategy 
selection using a measure of the value of com- 
putation. In another study (46), humans were 
found to differentially overestimate the frequency 
of highly stressful life events (such as lethal acci- 
dents and suicide). This “fallacy” can be viewed 
as rational under the assumption that only a small 
number of hypotheses can be sampled: Expected 
utility is maximized by a policy of utility-weighted 
sampling. 


Computational tradeoffs in 
sequential decision-making 


Computational rationality has played an impor- 
tant role in linking models of biological intelli- 
gence at the cognitive and neural levels in ways 
that can be seen most clearly in studies of se- 
quential decision-making. Humans and other 
animals appear to make use of different kinds 
of systems for sequential decision-making: “model- 
based” systems that use a rich model of the 
environment to form plans, and a less complex 
“model-free” system that uses cached values to 
make decisions (47). Although both converge 
to the same behavior with enough experience, 
the two kinds of systems exhibit different trade- 
offs in computational complexity and flexibility. 
Whereas model-based systems tend to be more 
flexible than the lighter-weight model-free sys- 
tems (because they can quickly adapt to changes 
in environment structure), they rely on more ex- 
pensive analyses (for example, tree-search or dy- 
namic programming algorithms for computing 
values). In contrast, the model-free systems use 
inexpensive, but less flexible, look-up tables or 
function approximators. These efforts have con- 
ceptual links to efforts in AI that have sought to 
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reduce effort and to speed up responses in real 
time by optimizing caches of inferences via off- 
line precomputation (27). 

Studies provide evidence that model-based and 
model-free systems are used in animal cognition 
and that they are supported by distinct regions 
of the prefrontal cortex (48) and striatum (49). 
Evidence further suggests that the brain achieves 
a balance between computational tradeoffs by 
using an adaptive arbitration between the two 
kinds of systems (50, 57). One way to implement 
such an arbitration mechanism is to view the 
invocation of the model-based system as a meta- 
action whose value is estimated by the model- 
free system (57). 

Early during learning to solve a task, when the 
model-free value estimates are relatively inaccu- 
rate, the benefits of using the model-based sys- 


Model-free: caching 


States 


0.3} 0.4] 0.3] 0.5| 0.1 


» Expected value — time/effort cost 


tem outweigh its cognitive costs. Thus, moderately 
trained animals will be sensitive to changes in 
the causal structure of the environment (for 
example, the devaluation of a food reinforcer by 
pairing it with illness). After extensive training, 
the model-free values are sufficiently accurate 
to attain a superior cost-benefit tradeoff (46). 
This increasing reliance on the model-free system 
manifests behaviorally in the form of “habits’”— 
computationally cheap but inflexible policies. 
For example, extensively trained animals will 
continue pursuing a policy that leads to previously 
devalued reinforcers (52). 

The arbitration mechanism described above 
appears to adhere to the principles of compu- 
tational rationality: The model-based system is 
invoked when deemed computationally advan- 
tageous through metareasoning (Fig. 4A). For 
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Fig. 4. Computational tradeoffs in use of different decision-making systems. (A) A fast but inflexible 
model-free system stores cached values in a look-up table but can also learn to invoke a slower but more 
flexible model-based system that uses forward search to construct an optimal plan. The cached value for 
invoking the model-based system (highlighted in green) is a simple form of metareasoning that weighs the 
expected value of forward search against time and effort costs. (B) Hippocampal place cell—firing patterns 
show the brain engaged in forward search at a choice point, sweeping ahead of the animal's current 
location. Each image shows the time-indexed representation intensity of locations in pseudocolor (red, 
high probability; blue, low probability). The representation intensity is computed by decoding spatial 
location from ensemble recordings in hippocampal area CA3. [Reprinted with permission from (56)] 
(C) Similar principles apply to more complex decision problems, such as Atari and Go (left). Al systems 
(right) use complex value function approximation architectures, such as deep convolutional nets (top 


right) [reprinted with permission from (60)], for model-free control, 


and sophisticated forward search 


strategies, such as Monte Carlo Tree Search (bottom right) [reprinted with permission from (61)], for 


model-based control. 
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example, reliance on the model-based system 
decreases when the availability of cognitive 
resources are transiently disrupted (53). Recent 
data show that the arbitration mechanism may 
be supported by the lateral prefrontal cortex (54), 
the same region involved in the registration of 
cognitive demand. 

Finer-grained metareasoning may play a role 
within the richer model-based systems them- 
selves. One way to approximate values is to adapt 
the sampling hypothesis to the sequential deci- 
sion setting, stochastically exploring trajectories 
through the state space and using these sample 
paths to construct a Monte Carlo estimator. Re- 
cently, a class of sampling algorithms known as 
Monte Carlo Tree Search (MCTS) has gained con- 
siderable traction on complex problems by bal- 
ancing exploration and exploitation to determine 
which trajectories to sample. MCTS has achieved 
state-of-the-art performance in computer Go as 
well as a number of other difficult sequential 
decision problems (55). A recent study analyzed 
MCTS within a computational rationality frame- 
work and showed how simulation decisions 
can be chosen to optimize the value of com- 
putation (20). 

There is evidence that the brain might use 
an algorithm resembling MCTS to solve spatial 
navigation problems. In the hippocampus, “place 
cells” respond selectively when an animal is in 
a particular spatial location and are activated 
sequentially when an animal considers two dif- 
ferent trajectories (Fig. 4B) (56). Pfeiffer and Foster 
(57) have shown that these sequences predict an 
animal’s immediate behavior, even for new start 
and goal locations. It is unknown whether for- 
ward sampling observed in place cells balances 
exploration and exploitation as in MCTS, explor- 
ing spatial environments the way MCTS explores 
game trees, or whether they are sensitive to the 
value of computation. These are important stand- 
ing questions in the computational neuroscience 
of decision-making. 

At the same time, AI researchers are beginning 
to explore powerful interactions between model- 
based and model-free decision-making systems 
parallel to the hybrid approaches that computa- 
tional cognitive neuroscientists have investigated 
(Fig. 4C). Model-free methods for game-playing 
based on deep neural networks can, with exten- 
sive training, match or exceed model-based MCTS 
approaches in the regimes that they have been 
trained on (58). Yet, combinations of MCTS and 
deep-network approaches beat either approach 
on its own (59) and may be a promising route to 
explain how human decision-making in complex 
sequential tasks can be so accurate and so fast 
yet still flexible to replan when circumstances 
change—the essence of acting intelligently in an 
uncertain world. 


Looking forward 


Computational rationality offers a potential uni- 
fying framework for the study of intelligence in 
minds, brains, and machines, based on three 
core ideas: that intelligent agents fundamen- 
tally seek to form beliefs and plan actions in 
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support of maximizing expected utility; that 
ideal MEU calculations may be intractable for 
real-world problems, but can be effectively ap- 
proximated by rational algorithms that maximize 
amore general expected utility incorporating the 
costs of computation; and that these algorithms 
can be rationally adapted to the organism’s spe- 
cific needs, either offline through engineering 
or evolutionary design, or online through meta- 
reasoning mechanisms for selecting the best 
approximation strategy in a given situation. 
We discussed case studies in which these ideas 
are being fruitfully applied across the disci- 
plines of intelligence, but we admit that a ge- 
nuine unifying theory remains mostly a promise 
for the future. We see great value in pursuing 
new studies that seek additional confirmation 
(or disconfirmation) of the roles of machinery for 
cost-sensitive computation in human cogni- 
tion, and for enabling advances in AI. Although 
we cannot foresee precisely where this road 
leads, our best guess is that the pursuit itself is 
a good bet—and as far as we can see, the best 
bet that we have. 
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Keeping cool 


Edited by Stella Hurtley 


Saharan silver ant 


ilver ants inhabit one of the hottest and driest environments on Earth, the Saharan sands, 

where most insects shrivel and die moments after contact. Shi et a/. show that the triangular 

shape of the silver hairs that cover their bodies enables this existence. The hairs both increase 

the reflection of near-infrared rays and dissipate heat from the ants’ bodies, even under full 

sun conditions. Evolution’s simple solution to intense heat management in this species could 
lead to better designs for passive cooling of human-produced objects. — SNV 


To defeat SIV, add a 


protein boost 

Despite 30 years of effort, no 
HIV-1 vaccine exists. Barouch 

et al. evaluated one promising 
strategy in rhesus macaques, 

a preclinical model commonly 
used to test potential HIV-1 
vaccine candidates. They 
immunized monkeys with adeno- 
virus-36 vectors engineered 


SCIENCE sciencemag.org 


to express SIV (simian immu- 
nodeficiency virus) genes and 
then boosted them with a 
recombinant gp120 envelope 
glycoprotein (Env) from SIV. This 
regimen afforded greater protec- 
tion than a strategy that instead 
used a viral vector-based boost. 
A parallel trial using a SHIV (sim- 
ian/human immunodeficiency 
virus)—based vaccine and chal- 
lenge model produced similar 
results. Whether this particular 


Science, this issue p. 298 


approach will be equally suc- 
cessful in humans remains to be 
tested. — KLM 


Science, this issue p. 320 


Skyrmions emerge 
in trilayers 


Skyrmions are tiny whirlpools of 
magnetic spin with potential to 
act as carriers of information in 


future devices. Skyrmions have 
been observed in multiple mate- 
rials but usually at impractically 
low temperatures. Jiang et al. 
used a constriction in a trilayer 
system to create skyrmions 

at room temperature (see the 
Perspective by von Bergmann). 
The authors pushed elongated 
magnetic domains through the 
constriction using an in-plane 
current, causing individual skyr- 
mion bubbles to form. — JS 


Science, this issue p. 283; 
see also p. 234 


Looking at teeny tiny 
platinum particles 


Electron microscopy is a 
powerful technique for taking 
snapshots of particles or images 
at near-atomic resolution. Park et 
al. studied free-floating platinum 
nanoparticles using electron 
microscopy and liquid cells (see 
the Perspective by Colliex). Using 
analytical techniques developed 
to study biological molecules, 
they reconstructed the three- 
dimensional features of the Pt 
particles at near-atomic resolu- 
tion. This approach has the scope 
to study a mixed population of 
particles one at a time and to 
study their synthesis as it occurs 
in solution. — MSL 


Science, this issue p. 290; 
see also p. 232 


Substrate channeling in 
morphine biosynthesis 


Poppies are still the most 
economically viable source of 
the excellent painkiller mor- 
phine. Winzer et a/. have now 
identified a key enzyme in the 
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poppy’s biosynthetic pathway 
for morphine. The enzyme turns 
out to be an unusual protein that 
contains both cytochrome P-450 
and oxidoreductase modules. 
Together these modules process 
two subsequent steps in the 
biosynthetic pathway. The 
identification of this enzyme 
may enable alternate routes 
for morphine biosynthesis that 
are less dependent on poppy 
cultivation. — PJH 

Science, this issue p. 309 


How germ cells 
become sperm or egg 


During vertebrate development, 
germ cells switch from a sexually 
indifferent to a committed state 
for either egg or sperm. Signals 
from somatic gonadal cells are 
generally thought to influence 
the sexual differentiation of 
germ cells. However, Nishimura 
et al. demonstrate that germ 
cell-intrinsic sex determination 
cues are at play in the teleost 
fish medaka. The forkhead box 
transcriptional factor foxl3 
represses the initiation of sper- 
matogenesis. In the absence of 
foxl3 function, females develop 
ovaries filled with functional 
sperm. Thus, the male gonad 
environment is not required for 
spermatogenesis.—BAP 
Science, this issue p. 328 


Movers and shakers 
When the edge of an ice sheet 
breaks off and falls into the sea 
(calves), the remaining section 


Icebergs near 
the terminus of 
Helheim Glacier 


280 


of the ice sheet moves backward 
and down and can suffer a glacial 
earthquake. Murray et al. studied 
calving from Greenland’s Helheim 
Glacier. The forces that cause the 
change in the motion of the ice 
sheet at its terminus also trigger 
the accompanying earthquakes. 
Because these seismic signals 
can be detected by instruments 
located all over the globe, it 
should be possible to use these 
glacial earthquakes as proxies for 
glacier calving. — HJS 

Science, this issue p. 305 


Long-acting drug to 
treat resistant malaria 


Malaria kills 0.6 million people 
annually. Currently available 
drugs are no longer fully effec- 
tive, because the malarial 
parasite has developed resis- 
tance. Now, Phillips et al. have 
identified a drug, DSM265, that 
kills both drug-sensitive and 
drug-resistant parasites by tar- 
geting their ability to synthesize 
precursors required for synthe- 
sis of DNA and RNA. DSM265 
kills parasites in the blood and 
the liver and is sufficiently long- 
acting that it could potentially 
cure malaria after a single dose 
or provide effective chemopre- 
vention if given weekly. — OMS 
Sci. Transl. Med.7, 296ral1l11 (2015). 


GBS toxin activates mast 
cells for host defense 


Ascending Group B streptococ- 
cus (GBS) is a major cause of 
preterm birth. How the mother 
defends against GBS infec- 
tions is not clear. Working in 
mice, Rajagopal et al. found that 
immune cells, called mast cells, 
are activated by a lipid toxin 
produced by GBS. This lipid toxin 
is an ornithine rhamno-polyene 
that stains the bacteria red. The 
toxin assists penetration of the 
placenta by the bacteria. The 
toxin also stimulates mast cell 
degranulation, an early step in 
host efforts to defeat the bacte- 
rial infection. — PLY 


Sci. Adv. 10.1126/ 
sciadv.1400225 (2015). 
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GENOME DEFENSE 


Hunter RNAs seek 


Edited by Sacha Vignieri 
and Jesse Smith 


Tetrahymena uses 
small RNAs to fend 
off transposable 
elements 


and destroy parasitic DNAs 


ransposable elements (TEs) are parasitic DNA sequences. 

They present a serious threat to the host genomes in 

which they reside. Like other eukaryotes, the single-celled 

protist Tetrahymena uses small RNAs to control parasitic 

TEs. Noto et al. show that a subset of these small RNAs 
recognize the TEs they are synthesized from as foreign, as well 
as sequence-related TEs in other places in the genome. These 
RNAs stimulate the production of a second wave of small RNAs 
at these distant sites. The second-wave, or “late,” RNAs can bind 
to still further TE sequences, ensuring a robust and heritable 
defense of the entire genome. — GR 


Mol. Cell 59, 10.1016/j.molcel.2015.05.024 (2015). 


X-ray echoes used 
as a cosmic yardstick 


Determining accurate distances 
to astronomical x-ray sources 

is notoriously difficult, but 
Heinz et al. have developed a 
new technique for doing so. 


Published by AAAS 


They exploited “light echoes” 
from the object Cir X-1, in which 
x-rays emitted during a large 
flare bounced off foreground 
dust and appeared months later 
as delayed rings of emission 
around the source. By compar- 
ing the x-ray rings to radio data, 
they were able to identify the 
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foreground clouds of dust and 
thereby measure the distance to 
Cir X-1 with an accuracy of 10%. 
Previous estimates had varied 
by almost a factor of 3. — KTS 
Astrophys. J. 806, 265 (2015). 


A gap in a topological 
surface state 


Topological insulators (Tls) 
have, in theory, conducting 
surfaces and insulating bulks. 
In practice, Tls often have a 
considerable bulk conductance 
that masks the conduction from 
the topologically protected 
surface states. To make the 
surface state nonconducting—in 
other words, to open a gap in its 
dispersion—researchers usually 
resort to magnetic doping. Now, 
Weber et al. demonstrate that a 
topological surface state of the 
compound Bi,Se, is naturally, 
although partially, gapped. To 
show this, the authors use a 
combination of photoemission 
measurements and calculations. 
The results may be more general 
within the subclass of Tls to 
which Bi,Se, belongs. — JS 
Phys. Rev. Lett. 10.1103/ 
PhysRevLett.114.256401 (2015). 


Trapped on the 
wrong pathway 


Protein folding can be described 
as diffusion over an energy 
landscape in conformational 
space to reach an energy 
minimum that represents a 
stable folded structure, and it is 
implicated in many diseases. A 
particularly dramatic example is 
the prion protein, PrP, that folds 
rapidly into a native monomeric 
structure but also has a stable 
oligomeric form that causes 
prion disease. Yu et al. used 
single-molecule force spectros- 
copy to monitor misfolding of a 
PrP dimer. The dimer misfolds 
along a single pathway involv- 
ing several intermediates, one 
of which blocks native folding. 
Diffusion across the energy 
landscape is 1000 times slower 
than for the folding of native 
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prion monomers, probably 
indicating that many unproduc- 
tive interactions occur during 
misfolding. — VV 
Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pnas.1419197112 (2015). 


Seeing the forest 
from the trees 


Boreal forests contain nearly 
half of the carbon stored by the 
trees of the world,and thus are a 
critical component of the global 
carbon cycle. How much carbon 
is contained in this part of the 
ecosystem still is inadequately 
known, though. Chen and Luo 
studied the net annual above- 
ground biomass change of four 


major boreal forest types in 
western Canada over the period 
between 1958 and 2011 and 
found that it declined for all of 
those groups. They attribute the 
loss to increased tree mortality 
and reduced growth, caused 
mostly by persistent warming 
and decreasing water availabil- 
ity, trends that are expected to 
escalate in the future. — HJS 


Global Change Biol. 10.1111/ 
gcb.12994 (2015). 


Contextual memory 
networks in monkey brains 


Memory tasks activate a number 
of areas in the prefrontal cortex 
of the brain. Lesions in this region 


The genetic underpinnings of brain size 


hat genes and selective processes lead to an increase in brain size and intelligence is one of 

the greatest questions in human evolution. Chen et al. present findings in fish that may help 

us understand this process. When guppies were selected for brain size, expression differ- 

ences were observed in the Angiopoietin-1 (Ang-1) gene. These differences seem to be due 

to noncoding variation; furthermore, lowering Ang-1 expression in the zebrafish also affected 
brain size. On the basis of these results, the authors suggest that further study of Ang-1 may help pro- 
vide insight into the evolution of brain size and cognition in other species, including humans. — LMZ 


cause memory deficits; however, 
lesions in only a fraction of the 
activated areas actually lead 

to severe memory loss. Osada 

et al. measured whole-brain 
activity during a memory task 

in monkeys. The activated areas 
and their task-specific functional 
connectivity formed a hierarchical 
network centered on a hub. This 
functional hub largely matched 
the documented sites where 
lesions had the most dramatic 
effect. Neighboring sites where 
lesions had no impact were much 
less affected. The functional con- 
nections of an area predicted the 
degree of memory loss much bet- 
ter than its role in the anatomical 
network. — PRS 


PLOS Biol. 10.1371/journal. 
pbio.1002177 (2015). 


> 


Angiopoietin-1 expression varies 
s with brain size in guppies 


Proc. R. Soc. London Ser. B10.1098/rspb.2015.0872 (2015). 


Published by AAAS 
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CHROMOSOMES 
Protein partners for 
chromosome silencing 


Female mammals have two X 
chromosomes, one of which is 
almost completely shut down 
during development. The long 
noncoding Xist RNA plays a role 
in this process. To understand 
how a whole chromosome can 
be stably inactivated, Minajigi 
et al. identified many of the 
proteins that bind to the Xist 
RNA, which include cohesins. 
Paradoxically, the interac- 
tion between Xist and cohesin 
subunits resulted in repulsion 
of cohesin complexes from the 
inactive X chromosome, chang- 
ing the three-dimensional shape 
of the whole chromosome. — GR 
Science, this issue p. 282 


PLANT ECOLOGY 
Grassland diversity and 
ecosystem productivity 


The relationship between plant 
species diversity and ecosystem 
productivity is controversial. 

The debate concerns whether 
diversity peaks at intermediate 
levels of productivity—the so- 
called humped-back model—or 
whether there is no clear predict- 
able relationship. Fraser et al. 
used a large, standardized, and 
geographically diverse sample of 
grasslands from six continents to 
confirm the validity and general- 
ity of the humped-back model. 
Their findings pave the way for a 
more mechanistic understanding 
of the factors controlling species 
diversity. — AMS 


Science, this issue p. 302 


HEAVY FERMIONS 
Probing the insulating 
state of SmB, 


When a metal is subjected to 
a strong magnetic field, its 
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electrons start rearranging 
into new energy levels, caus- 
ing its electronic properties to 
oscillate as a function of the 
field. Unexpectedly, Tan et al. 
observed this phenomenon, 
called quantum oscillations, in 
the Kondo insulator samarium 
hexaboride (SmB,), which does 
not conduct electricity. They 
measured the magnetic torque 
and detected quantum oscilla- 
tions originating from the bulk of 
this heavy fermion compound. 
These oscillations had an 
unusual temperature depen- 
dence, which presents another 
puzzle to theorists seeking to 
understand the nature of the 
insulating state of SmB,. — JS 
Science, this issue p. 287 


ANIMAL PHYSIOLOGY 
Not that unusual after all 


As polar ice recedes, polar bears 
are facing a changed habitat 
with reduced summer forag- 
ing opportunities. It has been 
hypothesized that they might 
be able to resist summer food 
shortages by reducing their met- 
abolic needs in a sort of “walking 
hibernation.” Whiteman et al. 
monitored energy expenditure in 
polar bears both on and off the 
ice and found energy reductions, 
but that these were more akin 
to normal mammalian fasting 
levels. Thus, it appears that polar 
bears have no energetic protec- 
tions against reduced summer 
food supplies and will face 
increasing starvation threats if 
summer foraging habitats con- 
tinue to decline. — SNV 

Science, this issue p. 295 


CIRCADIAN RHYTHMS 
Biochemical basis of a 
24-hour clock 


Circadian clocks keep organisms 
in synch with such daily cycles 
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as illumination, activity, and food 
availability. The circadian clock in 
cyanobacteria has the necessary 
24-hour period despite its three 
component proteins having bio- 
chemical activities that occur on 
amuch faster time scale. Abe et 
al. focused on the cyanobacterial 
clock component KaiC, an ade- 
nosine triphosphatase (ATPase) 
that can autophosphorylate and 
autodephosphorylate. The slow 
ATPase activity of KaiC, which is 
linked to a peptide isomerisation, 
provided the slow kinetics that set 
the speed of the 24-hour clock. 
Chang et al. found that another 
clock component, KaiB, also 
has slow changes in its protein 
conformation that help to set the 
oscillation period of the clock and 
its signaling output. — LBR 
Science, this issue pp. 312 and 324 


ECOLOGY 
The benefits of diversity 
Pathogens and parasites are 
an integral part of all ecosys- 
tems. But the likelihood of 
them causing disease depends 
on environmental factors. Ina 
Perspective, Keesing and Ostfeld 
highlight recent studies on 
disease prevalence as a function 
of diversity. In humans, other 
animals, and plants, disease 
prevalence is lower when diver- 
sity is high. This is the case even 
though pathogen diversity is 
higher in more-diverse systems. 
As biodiversity is lost from 
ecosystems, they become more 
vulnerable to infections. — JFU 
Science, this issue p. 235 


INFLAMMATION 


Neutrophil NETs drive 
atherosclerosis 


The buildup of fats, cholesterol, 
and other substances in arteries 
causes atherosclerosis, which 
restricts blood flow and can 


Published by AAAS 


lead to heart attacks and stroke. 
Inflammation contributes to 

the pathogenesis of athero- 
sclerosis, but exactly how is not 
fully understood. Warnatsch 

et al. now show that immune 
cells called neutrophils release 
NETs (neutrophil extracellular 
traps) (see the Perspective by 
Nahrendorf and Swirski). These 
NETs are composed of DNA and 
antimicrobial proteins, and in the 
setting of atherosclerosis they 
activate innate immune signal- 
ing pathways in macrophages. 
This causes the macrophages to 
secrete proinflammatory cyto- 
kines, exacerbating the disease. 
Indirectly, NETS also attract a 
specialized subset of T cells that 
further amplify the proinflam- 
matory response. -KLM 


Science, this issue p. 316; 
see also p. 237 


IMMUNOLOGY 
Fine-tuning the 


inflammatory response 
The binding of the bacterial 
product lipopolysaccharide 
(LPS) to the receptor TLR4 on 
macrophages triggers inflam- 
matory responses that require 
the transcription factor NF-kB. 
TLR4 recruits the adaptor protein 
MyD88 when the receptor is at 
the plasma membrane and a dif- 
ferent adaptor protein, TRIF, after 
internalization into endosomes. 
Cheng et al. found that MyD88 
was required for the initial peak 
of transient NF-kB activation in 
all LPS-stimulated cells (see also 
the Focus by Williams et al.). In 
contrast, TRIF was required for 
more sustained NF-kB activation 
in a subset of cells. Thus, macro- 
phages use both adaptor proteins 
to fine-tune NF-kB activation to 
induce an appropriate inflamma- 
tory response. — JFF 

Sci. Signal. 8, ra69 and fs13 (2014). 
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CHROMOSOMES 


A comprehensive Xist interactome 
reveals cohesin repulsion and an RNA- 
directed chromosome conformation 


Anand Minajigi,* John E. Froberg,* Chunyao Wei, Hongjae Sunwoo, Barry Kesner, 
David Colognori, Derek Lessing, Bernhard Payer, Myriam Boukhali, 


Wilhelm Haas, Jeannie T. Leet 


INTRODUCTION: The mammal has 
evolved an epigenetic mechanism to 
silence one of two X chromosomes in the 
XX female to equalize gene dosages with 
the XY male. Once established, the inacti- 
vated X chromosome (Xi) is extremely 
stable and is maintained through the 
lifetime of the female mammal. The prin- 
cipal regulator, Xist, is a long noncoding 
RNA that orchestrates the silencing proc- 
ess along the Xi. Xist is believed to 
operate as a scaffold to recruit and spread 
repressive complexes, such as Polycomb 
Repressive Complex 2, along the X chro- 
mosome. The identities of crucial in- 
teracting factors, however, have remained 
largely unknown. 
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RATIONALE: Although the Xi’s epigenetic 
stability is a necessary homeostatic prop- 
erty, an ability to unlock this epigenetic 
state is of great current interest. The X 
chromosome is home to nearly 1000 genes, 
at least 50 of which have been implicated in 
X-linked diseases, such as Rett syndrome 
and fragile X syndrome. The Xi is therefore 
a reservoir of functional genes that could be 
tapped to replace expression of a disease allele 
on the active X (Xa). A major gap in current 
understanding is the lack of a comprehensive 
Xist interactome. Progress toward a full 
interactome would advance knowledge of 
epigenetic regulation by long noncoding 
RNA and potentially inform treatment of X- 
linked diseases. 
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EXPULSION 


An operational model for how Xist RNA orchestrates the Xi state. Xist is a multitasking RNA 
that brings many layers of repression to the Xi. Although Xist RNA recruits repressive complexes 
(such as PRC1, PRC2, DNMT1, macroH2A, and SmcHD1) to establish and maintain the inactive 
state, it also actively repels activating factors and architectural proteins (Such as the cohesins and 
CTCF) to avoid acquisition of a transcription-favorable chromatin conformation. 
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RESULTS: We have developed an RNA-centric 
proteomic method called iDRiP (identification 
of direct RNA-interacting proteins). 

Using iDRiP, we identified 80 to 200 pro- 
teins in the Xist interactome. The interactors 
fall into several functional categories, includ- 
ing cohesins, condensins, topoisomerases, RNA 
helicases, chromatin remodelers, histone modi- 

fiers, DNA methyltransfer- 
ases, nucleoskeletal factors, 
Read the full article and nuclear matrix pro- 
at http://dx.doi. teins. Targeted inhibition 
org/10.1126/ demonstrates that Xi si- 
science.aab2276 lencing can be destabil- 
idetenaeomia craniutcatial ed by dienapting muitiple 
components of the interactome, consistent 
with the idea that these factors synergistical- 
ly repress Xi transcription. Triple-drug treat- 
ments lead to a net increase of Xi expression 
and up-regulation of ~100 to 200 Xi genes. We 
then carry out a focused study of X-linked cohe- 
sin sites. Chromatin immunoprecipitation se- 
quencing analysis demonstrates three types of 
cohesin sites on the X chromosome: Xi-specific 
sites, Xa-specific sites, and biallelic sites. We 
find that the Xa-specific binding sites represent 
a default state. Ablating Xist results in restora- 
tion of Xa-specific sites on the Xi. These find- 
ings demonstrate that, while Xist attracts 
repressive complexes to the Xi, it actively repels 
chromosomal architectural factors such as the 
cohesins from the Xi. Finally, we examine 
how Xist and the repulsion of cohesins affect 
Xi chromosome structure. In wild-type cells, 
the Xa is characterized by ~112 topologically 
associated domains (TADs) and the Xi by 
two megadomains. Intriguingly, loss of Xist 
and restoration of cohesin binding result in 
a reversion of the Xi to an Xa-like chro- 
mosome conformation. Hi-C analysis shows 
that TADs return to the Xi in a manner cor- 
related with the reappearance of cohesins and 
with a transcriptionally permissive state. 


CONCLUSION: Our study unveils many layers 
of Xi repression and demonstrates a central 
role for RNA in the topological organization 
of mammalian chromosomes. Our study also 
supports a model in which Xist RNA simulta- 
neously acts as (i) a scaffold for the recruitment 
of repressive complexes to establish and main- 
tain the inactive state and (ii) a repulsion 
mechanism to extrude architectural factors 
such as cohesins to avoid acquisition of a 
transcription-favorable chromatin conforma- 
tion. Finally, our findings indicate that the 
stability of the Xi can be perturbed by targeted 
inhibition of multiple components of the Xist 
interactome. & 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
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CHROMOSOMES 


A comprehensive Xist interactome 


reveals cohesin repulsion and an 
RNA-directed chromosome conformation 


Anand Minajigi,'* John E. Froberg,'* Chunyao Wei,’ Hongjae Sunwoo,' Barry Kesner,* 
David Colognori,’ Derek Lessing,’ Bernhard Payer,’ Myriam Boukhali,” 


Wilhelm Haas,” Jeannie T. Lee't 


The inactive X chromosome (Xi) serves as a model to understand gene silencing on a 
global scale. Here, we perform “identification of direct RNA interacting proteins” (iDRiP) 
to isolate a comprehensive protein interactome for Xist, an RNA required for Xi 

silencing. We discover multiple classes of interactors—including cohesins, condensins, 
topoisomerases, RNA helicases, chromatin remodelers, and modifiers—that synergistically 
repress Xi transcription. Inhibiting two or three interactors destabilizes silencing. 
Although Xist attracts some interactors, it repels architectural factors. Xist evicts cohesins 
from the Xi and directs an Xi-specific chromosome conformation. Upon deleting Xist, 

the Xi acquires the cohesin-binding and chromosomal architecture of the active X. 

Our study unveils many layers of Xi repression and demonstrates a central role for RNA 
in the topological organization of mammalian chromosomes. 


he mammalian X chromosome is unique in 
its ability to undergo whole-chromosome 
silencing. In the early female embryo, X- 
chromosome inactivation (XCI) enables 
mammals to achieve gene dosage equiv- 
alence between the XX female and the XY male 
(1-3). XCI depends on Xist RNA, a 17-kb long 
noncoding RNA (IncRNA) expressed only from 
the inactive X chromosome (Xi) (4) and that im- 
plements silencing by recruiting repressive com- 
plexes (5-8). Whereas XCI initiates only once during 
development, the female mammal stably main- 
tains the Xi through her lifetime. In mice, a germ- 
line deletion of Xist results in peri-implantation 
lethality due to a failure of Xi establishment (9), 
whereas a lineage-specific deletion of Xist causes 
a lethal blood cancer due to a failure of Xi mainte- 
nance (JO). Thus, both the de novo establishment 
and proper maintenance of the Xi are crucial for 
viability and homeostasis. There are therefore 
two critical phases of XCI: (i) A one-time initia- 
tion phase in peri-implantation embryonic devel- 
opment that is recapitulated by differentiating 
embryonic stem (ES) cells in culture, and (ii) a 
lifelong maintenance phase that persists in all 
somatic lineages. 
Once established, the Xi is extremely stable 
and difficult to disrupt genetically and pharma- 
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cologically (11-13). In mice, X reactivation is pro- 
grammed to occur only twice: once in the blastocyst 
to erase the imprinted XCI pattern and a second 
time in the germ line before meiosis (14, 15). Al- 
though the Xi’s epigenetic stability is a homeo- 
static asset, an ability to unlock this epigenetic 
state is of great current interest. The X chromo- 
some is home to nearly 1000 genes, at least 50 of 
which have been implicated in X-linked diseases, 
such as Rett syndrome and fragile X syndrome. 
The Xi is therefore a reservoir of functional genes 
that could be tapped to replace expression of a 
disease allele on the active X (Xa). A better un- 
derstanding of Xi repression would inform both 
basic biological mechanisms and treatment of 
X-linked diseases. 

It is believed that Xist RNA silences the Xi 
through conjugate protein partners. A major gap 
in current understanding is the lack of a compre- 
hensive Xist interactome. Despite multiple at- 
tempts to define the complete interactome, only 
four directly interacting partners have been iden- 
tified over the past two decades, including PRC2, 
ATRX, YY1, and HNRPU: Polycomb repressive 
complex 2 (PRC2) is targeted by Xist RNA to the 
Xi; the ATRX RNA helicase is required for the 
specific association between Xist and PRC2 
(16, 17); YY1 tethers the Xist-PRC2 complex to the 
Xi nucleation center (78); and the nuclear matrix 
factor, HNRPU/SAF-A, enables stable association 
of Xist with the chromosomal territory (19). Many 
additional interacting partners are expected, given 
the large size of Xist RNA and its numerous 
conserved modular domains. We have developed 
an RNA-based proteomic method and implement 
an unbiased screen for Xist’s comprehensive in- 
teractome. We identify a large number of high- 


confidence candidates, demonstrate that it is 
possible to destabilize Xi repression by inhibiting 
multiple interacting components, and then delve 
into a focused set of interactors with the 
cohesins. 


Results 
iDRiP identifies multiple classes of 
Xist-interacting proteins 


A systematic identification of interacting factors 
has been challenging because of Xist’s large size, 
the expected complexity of the interactome, and 
the persistent problem of high background with 
existing biochemical approaches (20). A high 
background could be particularly problematic 
for chemical cross-linkers that create extensive 
covalent networks of proteins, which could in 
turn mask specific and direct interactions. We 
therefore developed iDRiP (identification of direct 
RNA interacting proteins) using the zero-length 
cross-linker, ultraviolet (UV) light, to implement 
an unbiased screen of directly interacting pro- 
teins in female mouse fibroblasts expressing phys- 
iological levels of Xist RNA (Fig. 1A). We performed 
in vivo UV cross-linking, prepared nuclei, and 
solubilized chromatin by DNase I digestion. Xist- 
specific complexes were captured using nine com- 
plementary oligonucleotide probes spaced across 
the 17-kb RNA, with a 25-nucleotide probe length 
designed to maximize RNA capture while reduc- 
ing nonspecific hybridization. The complexes were 
washed under denaturing conditions to elimi- 
nate factors not covalently linked by UV to Xist 
RNA. To minimize background due to DNA-bound 
proteins, a key step was inclusion of DNase I 
treatment before elution of complexes (see the 
supplementary text). We observed substantial 
enrichment of Xist RNA over highly abundant 
cytoplasmic and nuclear RNAs (U6, Jpx, and 18S 
ribosomal RNA) in eluates of female fibroblasts 
(Fig. 1B). Enrichment was not observed in male 
eluates or with luciferase capture probes. Eluted 
proteins were subjected to quantitative mass spec- 
trometry (MS), with spectral counting (27) and 
multiplexed quantitative proteomics (22) yield- 
ing similar enrichment sets (table S1). 

From three independent replicates, iDRiP-MS 
revealed a large Xist protein interactome (Fig. 1C 
and table S1). Recovery of known Xist interac- 
tors, PRC2, ATRX, and HNRPU, provided a first 
validation of the iDRiP technique. Also recovered 
were macrohistone H2A (mH2A), RINGI1 (PRC1), 
and the condensin component, SncHD1—proteins 
known to be enriched on the Xi (19, 23, 24) but 
not previously shown to interact directly with Xist. 
More than 80 proteins were found to be =threefold 
enriched over background; >200 proteins were 
>twofold enriched (table S1). In many cases, mul- 
tiple subunits of the epigenetic complex were 
identified, boosting our confidence in them as 
interactors. We verified select interactions by 
performing a test of reciprocity: By baiting with 
candidate proteins in an antibody capture, RIP- 
qPCR (RNA immunoprecipitation-quantitative 
polymerase chain reaction) of UV-cross-linked cells 
reciprocally identified Xist RNA in the pulldowns 
(Fig. 1D). Called on the basis of high enrichment 
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values, presence of multiple subunits within a 
candidate epigenetic complex, and tests of reci- 
procity, novel high-confidence interactors fell into 
several functional categories: (i) Cohesin complex 
proteins SMCla, SMC3, RAD21, WAPL, and PDS5a/ 
b, as well as CCCTC-binding factor (CTCF) (25), 
which are collectively implicated in chromosome 
looping (26-28); (ii) histone modifiers such as 
aurora kinase B (AURKB), a serine/threonine ki- 
nase that phosphorylates histone H3 (29); RINGI, 
the catalytic subunit of polycomb repressive 
complex 1 (PRC1) for H2A-K119 ubiquitylation 
(23); SPEN and RBM15, which associate with 
histone deacetylases; (iii) switch/sucrose non- 
fermentable (SWI/SNF) chromatin remodeling 
factors; (iv) topoisomerases TOP2a, TOP2b, and 
TOP1, which relieve torsional stress during tran- 
scription and DNA replication; (v) miscellaneous 
transcriptional regulators like MYEF2 and ELAV1; 
(vi) nucleoskeletal proteins that anchor chromo- 
somes to the nuclear envelope, SUN2, lamin-B re- 
ceptor (LBR), and LAP2; (vii) nuclear matrix proteins 
hnRPU/SAF-A, hnRPK, and MATRIN3; and (viii) the 
DNA methyltransferase DNMT1, known as a mainte- 
nance methylase for CpG dinucleotides (30). 

To study their function, we first performed 
RNA immunofluorescence in situ hybridization 
(immunoFISH) of female cells and observed sev- 
eral patterns of Xi coverage relative to the sur- 
rounding nucleoplasm (Fig. IE). Like PRC2, RING1 
(PRC1) has been shown to be enriched on the Xi 
(23) and is therefore not pursued further. TOP1 
and TOP2a/b appeared neither enriched nor de- 
pleted on the Xi (100%, n > 50 nuclei). AURKB 
showed two patterns of localization: pericentric 
enrichment (20%, n > 50) and a more diffuse lo- 
calization pattern (80%, data not shown), consist- 
ent with its cell-cycle-dependent chromosomal 
localization (29). On the other hand, whereas 
SUN2 was depleted on the Xi (100%, n = 52), it 
often appeared as pinpoints around the Xi in 
both day 7 differentiating female ES cells (estab- 
lishment phase; 44%, 7 = 307) and in fibroblasts 
(maintenance phase; 38.5%, n = 52), consistent 
with SUN2’s function in tethering telomeres to 
the nuclear envelope. Finally, the cohesins and 
SWI/SNF remodelers unexpectedly showed a de- 
pletion relative to the surrounding nucleoplasm 
(100%, m = 50 to 100). These patterns suggest 
that the Xist interactors operate in different XCI 
pathways. 

To ask if the factors intersect the PRC2 path- 
way, we stably knocked down (KD) top candidates 
using short-hairpin RNAs (shRNAs) (table S2) and 
performed RNA immunoFISH to examine trimeth- 
ylation of histone H3-lysine 27 (H3K27me3) (Fig. 2, 
A and B). No major changes to Xist localization 
or H3K27me3 were evident in d7 ES cells (fig. S1). 
There were, however, long-term effects in fibro- 
blasts: The decrease in H3K27me3 enrichment in 
shSMARCC1 and shSMARCAS cells (Fig. 2, A and 
B) indicated that SWI/SNF interaction with Xist 
is required for proper maintenance of PRC2 func- 
tion on the Xi. Steady-state Xist levels did not 
change by more than twofold (Fig. 2C) and were 
therefore unlikely to be the cause of the polycomb 
defect. Knockdowns of other factors (cohesins, 
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topoisomerases, SUN2, and AURKB) had no ob- 
vious effects on Xist localization and H3K27me3. 
Thus, whereas the SWI/SNF factors intersect the 
PRC2 pathway, other interactors do not overtly 
affect PRC2. 


Xi reactivation via targeted inhibition of 
synergistic interactors 


Given the large number of interactors, we created 
a screen to analyze effects on Xi gene expression. 
We derived clonal fibroblast lines harboring a 
transgenic green fluorescent protein (GFP) reporter 
on the Xi (fig. $2) and shRNAs against Xist in- 
teractors. Knockdown of any one interactor did 
not reactivate GFP by more than fourfold (Fig. 3A, 
shControl + none, and fig. S3A). Suspecting syn- 
ergistic repression, we targeted multiple pathways 
using a combination of drugs. To target DNMT1, 
we employed the small molecule 5'-azacytidine 
(aza) (30) at a nontoxic concentration of 0.3 uM 
[smedian inhibitory concentration (IC;,)], which 
minimally reactivated GFP (Fig. 3A, shControl + 
aza). To target TOP2a/b (37), we employed eto- 
poside (eto) at 0.3 uM (<IC;9), which also min- 
imally reactivated GFP (Fig. 3A, shControl + eto). 
Combining 0.3 uM aza + eto led to an 80- to 90- 
fold reactivation—a level that was almost half of 
GFP levels on the Xa (Xa-GFP) (Fig. 3A), suggest- 
ing strong synergy between DNMTI1 and TOP2 
inhibitors. Using aza + eto as priming agents, we 
designed triple-drug combinations inclusive of 
shRNAs for proteins that have no specific small- 
molecule inhibitors. In various shRNA + aza + 
eto combinations, we achieved up to 230-fold 
GFP reactivation—levels that equaled or exceeded 
Xa-GFP levels (Fig. 3A). The greatest effects were 
observed for combinations using shSMARCC2 
(227x), SaSMARCA4 (180x), and shRAD21 (211x). 
shTOP1 and shCTCF were also effective (1’75x and 
154x, respectively). Combinations involving re- 
maining interactors yielded 63 to 94x reactivation. 

We then performed allele-specific RNA sequenc- 
ing (RNA-seq) to investigate native Xi genes. In 
an F1 hybrid fibroblast line in which the Xi is of 
Mus musculus (mus) origin and the Xa of Mus 
casteneus (cas) origin, >600,000 X-linked sequence 
polymorphisms enabled allele-specific calls (32). 
Two biological replicates of each of the most 
promising triple-drug treatments showed good 
correlation (figs. S4 to S6). RNA-seq analysis 
showed reactivation of '75 to 100 Xi-specific genes 
in one replicate (Fig. 3B) and up to 200 in a sec- 
ond replicate (fig. S3B), representing a large frac- 
tion of expressed X-linked genes, considering that 
only ~210 X-linked genes have an fragments per 
kilobase of transcript per million mapped reads 
(FPKM) = 1.0 in this hybrid fibroblast line. Heat 
map analysis demonstrated that, for individual 
Xi genes, reactivation levels ranged from 2 to 
80x for various combinatorial treatments (Fig. 
3C). There was a net increase in expression level 
(AFPKM) from the Xi in the triple-drug-treated 
samples relative to the shControl + aza + eto, 
whereas the Xa and autosomes showed no ob- 
vious net increase, thereby suggesting preferen- 
tial effects on the Xi due to targeting synergistic 
components of the Xist interactome. Reactivation 


was not specific to any one Xi region (Fig. 3D). 
Most effective were shRAD21, shSMC3, shSMCla, 
shSMARCA4, shTOP2a, and shAURKB drug com- 
binations. Genic examination confirmed increased 
representation of mus-specific tags (red) relative 
to the shControl (Fig. 3E). Such allelic effects 
were not observed at imprinted loci and other 
autosomal genes (fig. S7), further suggesting 
Xi-specific allelic effects. The set of reactivated 
genes varied among drug treatments, although 
some genes (e.g., Rbbp7, G6pdx, and Fmrl) ap- 
peared more prone to reactivation. Thus, the Xi 
is maintained by multiple synergistic pathways, 
and Xi genes can be reactivated preferentially by 
targeting two or more synergistic Xist interactors. 


Xist interaction leads to cohesin repulsion 


To investigate the mechanism, we focused on 
one group of interactors—the cohesins—because 
they were among the highest-confidence hits 
and their knockdowns consistently destabilized 
Xi repression. To obtain Xa and Xi binding pat- 
terns, we performed allele-specific chromatin im- 
munoprecipitation sequencing (ChIP-seq) for 
two cohesin subunits, SMCla and RAD21, and 
for CTCF, which works together with cohesins 
(28, 833-35). In wild-type cells, CTCF binding was 
enriched on Xa (cas) but also showed a number 
of Xi (mus)-specific sites (Fig. 4A) (25, 36). Allelic 
ratios ranged from equal to nearly complete Xa 
or Xi skewing (Fig. 4A). For the cohesins, 1490 
SMCla and 871 RAD21 binding sites were mapped 
onto the X chromosome in total, of which allelic 
calls could be made on ~50% of sites (Fig. 4, B and 
C). Although the Xa and Xi each showed signi- 
ficant cohesin binding, Xa-specific sites greatly 
outnumbered Xi-specific sites. For SMCla, 717 sites 
were called on Xa, of which 589 were Xa-specific; 
203 sites were called on Xi, of which 20 were Xi- 
specific. For RAD21, 476 sites were called on Xa, of 
which 336 were Xa-specific; 162 sites were called 
on Xi, of which 18 were Xi-specific. Biological 
replicates showed similar trends (fig. S8, A and B). 

Cohesin’s Xa preference was unexpected in 
light of Xist’s physical interaction with cohesins— 
an interaction suggesting that Xist might recruit 
cohesins to the Xi. We therefore conditionally 
ablated Xist from the Xi (Xi****) and repeated 
ChIP-seq analysis in the Xi****/Xa fibroblasts 
(37). Surprisingly, Xi**t acquired 106 SMCla 
and 48 RAD21 sites in cis at positions that were 
previously Xa-specific (Fig. 4, C and D). Biolog- 
ical replicates trended similarly (figs. S8 and S9). 
In nearly all cases, acquired sites represented a 
restoration of Xa sites, rather than binding to 
random positions. By contrast, sites that were 
previously Xi-specific remained intact (Fig. 4, C 
and E, and fig. S8B), suggesting that they do not 
require Xist for their maintenance. The changes 
in cohesin peak densities were Xi-specific and sig- 
nificant (Fig. 4F). Cohesin restoration occurred 
throughout Xi*“**, resulting in domains of bi- 
allelic binding (Fig. 4G and figs. S10 to $12), and 
often favored regions that harbor genes that es- 
cape XCI (e.g., Bgn) (38, 39). There were also shifts 
in CTCF binding, more noticeable at a locus- 
specific level than at a chromosomal level (Fig. 4, 
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Fig. 1. iDRiP-MS reveals a large Xist interactome. (A) iDRiP schematic. 
(B) RT-qgPCR demonstrated the specificity of Xist pulldown by iDRiP. Xist and 
control luciferase probes were used for pulldown from UV-cross-linked female 
and control male fibroblasts. Efficiency of Xist pulldown was calculated by 
comparing to a standard curve generated using 10-fold dilutions of input. Mean + 
standard error (SE) of three independent experiments shown. P determined 
by Student's t test. (C) Selected high-confidence candidates from three 
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biological replicates are grouped into functional classes. Additional candi- 
dates are shown in table Sl. (D) UV-RIP-qPCR validation of candidate in- 
teractors. Enrichment calculated as percentage input, as in (B). Mean + SE of 
three independent experiments shown. P determined by Student's t test. 
(E) RNA immunoFISH to examine localization of candidate interactors 
(green) in relation to Xist RNA (red). Immortalized MEF cells are tetraploid 
and harbor two Xi. 
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A and G), suggesting that CTCF and cohesins do 
not necessarily track together on the Xi. The ob- 
served dynamics were X chromosome-specific 
and were not observed on autosomes (fig. S13). 
To determine whether there were restoration 
hotspots, we plotted restored SMCla and RAD21 
sites (Fig. 4H, purple) on Xi*™*t and observed 
clustering within gene-rich regions. We conclude 
that Xist does not recruit cohesins to the Xi- 
specific sites. Instead, Xist actively repels cohe- 
sins in cis to prevent establishment of the Xa 
pattern. 


Xist RNA directs an Xi-specific 
chromosome conformation 


Cohesins and CTCF have been shown to facili- 
tate formation of large chromosomal domains 
called TADs (topologically associated domains) 
(27, 28, 34, 35, 40-42). The function of TADs is 
currently not understood, because TADs are largely 
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invariant across development. However, X-linked 
domains are exceptions to this rule and are 
therefore compelling models to study function of 
topological structures (43-46). By carrying out 
allele-specific Hi-C, we asked whether cohesin 
restoration altered the chromosomal architec- 
ture of Xi**"*t. First, we observed that, in wild- 
type cells, our TADs called on autosomal contact 
maps at 40-kb resolution resembled published 
composite (nonallelic) maps (27) (Fig. 5A, bottom). 
Our X chromosome contact maps were also con- 
sistent, with TADs being less distinct due to a 
summation of Xa and Xi reads in the composite 
profiles (Fig. 5A, top). Using the 44% of reads 
with allelic information, our allelic analysis 
yielded high-quality contact maps at 100-kb 
resolution by combining replicates (Fig. 5B and 
fig. S14A) or at 200-kb resolution with a single 
replicate. In wild-type cells, we deduced 112 TADs 
at 40-kb resolution on the X chromosome using the 


Xist RNA H3K27me3 


fo) 


shRAD21 
shSMCta 
shSMC3 
shRAD21 
shAURKB 
shTOP2b 


shSMCila ba shSMC3 eB 
Cy bes shAURKB - 


method of Dixon et al. (27). We attempted TAD 
calling for the Xi on the 100-kb contact map but 
were unable to obtain obvious TADs, suggesting 
that the 112 TADs are present only on the Xa. The 
Xi instead appeared to be partitioned into two 
megadomains at the DX7/4 region (fig. S14A) (46). 
Thus, although the Xa is topologically organized 
into structured domains, the Xi is devoid of such 
megabase-scale structures across its full length. 
When Xist was ablated, however, TADs were 
restored in cis, and the Xi reverted to an Xa-like 
conformation (Fig. 5B and fig. S14B). In mutant 
cells, ~30 TADs were gained on Xi**** in each 
biological replicate. Where TADs were restored, 
xi**"*" patterns (red) became nearly identical to 
those of the Xa (blue), with similar interaction 
frequencies. These Xi*™"' regions now bore little 
resemblance to the Xi of wild-type cells (xi¥’, 
orange). Overall, the difference in the average in- 
teraction scores between Xi™7 and Xi****' was 
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Fig. 2. Effect of depleting Xist interactors on H3K27me3. (A) RNA 
immunoFISH of Xist (red) and H3K27me3 (green) after snRNA KD of inter- 
actors in fibroblasts (tetraploid; two Xist clouds). KD efficiencies (fraction 
remaining): SMCla-0.48, SMC3-0.39, RAD21-0.15, AURKB-0.27, TOP2b- 
0.20, TOP2a-0.42, TOP1-0.34, CTCF-0.62, SMARCA4-0.52, SMARCAS5-0.18, 
SMARCC1-0.25, SMARCC2-0.32, SMARCB1-0.52, and SUN2-0.72. Some fac- 
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tors are essential; therefore, high-percentage KD may be inviable. All images 
are presented at the same photographic exposure and contrast. (B) Quan- 
titation of RNA immunoFISH results. n, sample size. Percentages of aberrant 
Xist/H3K27me3 associations are shown. (C) RT-qPCR of Xist levels in KD fi- 
broblasts, normalized to shControls. Means + SD of two independent exper- 
iments are shown. 
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Fig. 3. Derepression of Xi genes by targeting Xist interactors. (A) Relative 
GFP levels by RT-qPCR analysis in female fibroblasts stably knocked down for 
indicated interactors + 0.3 uM 5'-azacytidine (aza) + 0.3 uM etoposide (eto). Xa- 
GFP. control male fibroblasts with X-linked GFP. Means + SE of two independent 
experiments are shown. P determined by Student's t test. (B) Allele-specific RNA- 
seq analysis: Number of up-regulated Xi genes for each indicated triple-drug 
treatment (aza + eto + shRNA). Blue, genes specifically reactivated on Xi [fold 
change (FC) > 2]; red, genes also up-regulated on Xa (FC > 1.3). (©) RNA-seq heat 
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map indicating that a large number of genes on the Xi were reactivated. X-linked 
genes reactivated in at least one of the triple-drug treatment (aza + eto + snRNA) 
were shown in the heat map. Color key, Log2 FC. Cluster analysis performed based 
on similarity of KD profiles (across) and on the sensitivity and selectivity of various 
genes to reactivation (down). (D) Chromosomal locations of Xi reactivated genes 
(colored ticks) for various aza + eto + SHRNA combinations. (E) Read coverage of 
four reactivated Xi genes after triple-drug treatment. Xi, mus reads (scale, O to 2). 
Comp, total reads (scale, O to 6). Red tags appear only in exons with SNPs. 
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highly significant (Fig. 5C and fig. SISA). Inter- 
secting TADs with SMCla sites on Xi**"*' revealed 
that 61 restored cohesin sites overlapped restored 
TADs (61 did not overlap). In general, restored 
cohesin sites occurred both within TADs and at 


TAD borders. TADs overlapping restored peaks 
had larger increases in interaction scores relative 
to all other TADs (Fig. 5D and fig. S15B), and we 
observed an excellent correlation between the 
restored cohesin sites and the restored TADs 


(Fig. 5E and fig. S15C), consistent with a role of 
cohesins in reestablishing TADs after Xist deletion. 
Taken together, these data uncover a role for RNA 
in establishing topological domains of mammalian 
chromosomes and demonstrate that Xist must 
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SMCla or RAD21 peaks. 

Allele-specific skewing is defined as =threefold skew toward either Xa (cas, blue 
dots) or Xi (mus, red dots). Biallelic peaks, gray dots. (C) Table of total, 
Xa-specific, and Xi-specific cohesin binding sites in WT versus AXist (Xi**'7Xa"") 
cells. Significant SMCla and RAD21 allelic peaks with =5 reads were analyzed. 
Allele-specific skewing is defined as =threefold skew toward Xa or Xi. Sites 
were considered “restored” if Xi4*'S"s read counts were >50% of Xa’s. X-total, all 
X-linked binding sites; allelic peaks, sites with allelic information; Xa-total, all Xa 
sites; Xi-total, all sites; Xa-spec, Xa-specific; Xi-spec, Xi-specific; Xi-invariant, 
Xi-specific in both WT and Xi4*"*t7Xa"" cells. There is a net gain of 96 sites on 
the Xi in the mutant, a number different from the number of restored sites 
(106). This difference arises because restored peaks are defined as sites that 
are heavily Xa-skewed in WT but acquire substantial Xi-binding after Xist 
deletion; thus, the number of restored sites is not simply the net change in Xi 
sites. (D) Partial restoration of SMCla or RAD21 peaks on the Xi**"*t to an Xa 
pattern. Plotted are peaks with read counts with =threefold skew toward Xa" 
(Xa-specific). x axis, normalized Xa" read counts; y axis, normalized Xi4*'** 


aab2276-6 17 JULY 2015 + VOL 349 ISSUE 6245 


ee eed i Vr 


read counts; black diagonal, 1:1 Xi****7Xa ratio; red diagonal, 1:2 ratio. (E) Xi- 
specific SMCla or RAD21 peaks remained on Xi4*"**, Plotted are read counts 
for SMCla or RAD21 peaks with >threefold skew toward Xi“ (‘“Xi-specific”). 
(F) Comparison of fold changes for CTCF, RAD21, and SMC1 binding in X4*'st 
cells relative to WT cells. Shown are fold changes for Xi versus Xa. The Xi showed 
significant gains in RAD21 and SMCla binding, but not in CTCF binding. Method: 
XT and x4*‘st ChIP samples were normalized by scaling to equal read counts. 
Fold changes for Xi were computed by dividing the normalized mus read count in 
xdXist by the mus read count X“"; fold changes for Xa were computed by dividing 
the normalized cas read count in X°** by the cas read count X“" To eliminate 
noise, peaks with <10 allelic reads were eliminated from analysis. P values were 
determined by a paired Wilcoxon signed rank test. (G) The representative 
examples of cohesin restoration on Xi**'st. Arrowheads, restored peaks. 
(H) Allelic-specific cohesin binding profiles of Xa, XiM", and Xi**'S*. Shown 
below restored sites are regions of Xi reactivation after shSMCla and shRAD21 
triple-drug treatments, as defined in Fig. 3. 
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Fig. 5. Ablating Xist 
results in Xi rever- 
sion to an Xa-like 
chromosome confor- 
mation. (A) Chr13 and 
X chromosome contact 
maps showing triangu- 
lar domains representa- 
tive of TADs. Purple 
shades correspond to 
varying interaction fre- 
quencies (dark, greater 
interactions). TADs 
called from our 
composite (nonallelic) 
HiC data at 40-kb reso- 
lution (blue bars) are 
highly similar to those 
(gray bars) called previ- 
ously (27). (B) Allele- 
specific HiC-seq 
analysis: Contact maps 
for three different 
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actively and continually repulse cohesins from the 
Xi, even during the maintenance phase, to prevent 
formation of an Xa chromosomal architecture. 


Discussion 


Using iDRiP, we have identified a comprehen- 
sive Xist interactome and revealed multiple syn- 
ergistic pathways to Xi repression (Fig. 6). With 
Xist physically contacting 80 to 250 proteins at 
any given time, the Xist ribonucleoprotein parti- 
cle may be as large as the ribosome. Our study 
supports a model in which Xist RNA simulta- 
neously acts as (i) scaffold for the recruitment of 
repressive complexes (such as PRC1, PRC2, ATRX, 
mH2A, and SmcHDI) to establish and maintain 
the inactive state; and as (ii) a repulsion mecha- 
nism to extrude architectural factors such as 
cohesins to avoid acquisition of a transcription- 
favorable chromatin conformation. Without Xist, 
cohesins return to their default Xa binding state. 
Repulsion could be based on eviction, with Xist 
releasing cohesins as it extrudes them, or on se- 
questration, with Xist sheltering cohesins to pre- 
vent Xi binding. Our study shows that the Xi 
harbors three types of cohesin sites: (i) Xi-specific 
sites that do not depend on Xist; (ii) biallelic 
sites that are also Xist-independent; and (iii) Xa- 
specific sites, many of which cannot be estab- 
lished on the Xi because of active repulsion by 
Xist. The type (i) and type (iii) sites likely explain 
the paradoxical observations that, on the one hand, 
depleting cohesins leads to Xi reactivation but, 
on the other, loss of Xist-mediated cohesin recruit- 
ment leads to an Xa-like chromosome conforma- 
tion that is permissive for transcription. In essence, 
modulating the type (i) and type (iii) sites both 
have the effect of destabilizing the Xi, rendering 
the Xi more accessible to transcription. Disrupt- 
ing type (i) sites by cohesin knockdown would 
change the repressive Xi structure, while ablating 
Xist would restore the type (iii) sites that promote 
an Xa-like conformation. Our study has focused 
on cohesins, but RNA-mediated repulsion may be 
an outcome for other Xist interactors and may be 
as prevalent an epigenetic mechanism as RNA- 
mediated recruitment (47). 

The robustness of Xi silencing is demonstrated 
by the observation that we destabilized the Xi 
only after pharmacologically targeting two or 
three distinct pathways. The fact that the triple- 
drug treatments varied with respect to reacti- 
vated loci and depth of derepression creates the 
possibility of treating X-linked disease in a locus- 
specific manner by administering unique drug 
combinations. Given the existence of many other 
disease-associated IncRNAs, the iDRiP technique 
could be applied systematically toward identify- 
ing new drug targets for other diseases and gen- 
erally for elucidating mechanisms of epigenetic 
regulation by ncRNA. 


Materials and methods 

Identification of Direct RNA interacting 
Proteins (iDRiP) 

Mouse embryonic fibroblasts (MEFs) were irra- 
diated with UV light at 200 mJ energy (Stratagene 
2400) after rinsing with phosphate-buffered 
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saline (PBS). The pellets were resuspended in 
cytoskeleton buffer with 0.5% Triton X-100 
(CSKT)-0.5% [10 mM piperazinediethanesulfonic 
acid, pH 6.8, 100 mM NaCl, 3 mM MgCl, 0.38 M 
sucrose, 0.5% Triton X-100, 1 mM phenylmethyl- 
sulfonyl fluoride (PMSF)] for 10 min at 4°C fol- 
lowed by a spin. The pellets were again resuspended 
in nuclear isolation buffer (10 mM Tris pH 7.5, 
10 mM KCl, 0.5% Nonidet-P 40, 1x protease in- 
hibitors, 1 mM PMSF), and rotated at 4°C for 10 min 
(optional step). The pellets were collected after 
aspin, weighed, flash frozen in liquid nitrogen, 
and stored at -80°C until use. 

Approximately equal amounts of female and 
male UV cross-linked pellets were thawed and 
resuspended for treatment with Turbo DNase I 
in the DNase I digestion buffer (50 mM Tris pH 
7.5, 0.5% Nonidet-P 40, 0.1% sodium lauroyl sar- 
cosine, 1x protease inhibitors, SuperaseIn). The 
tubes were rotated at 37°C for 45 min with inter- 
mittent mixing or vortexing. The nuclear lysates 
were further solubilized by adding 1% sodium 
lauroyl sarcosine, 0.3 M lithium chloride, 25 mM 
EDTA, and 25 mM EGTA to final concentrations. 
After brief vortexing, continue incubation at 37°C 
for 15 min. The lysates were mixed with biotiny- 
lated DNA probes (table S3) prebound to the 
streptavidin magnetic beads (MyOne streptavidin 
C1 Dyna beads, Invitrogen) and incubated at 
55°C for 1 hour before overnight incubation at 
37°C in the hybridization chamber. The beads 
were washed three times in wash buffer (10 mM 
Tris, pH 7.5, 0.3 M LiCl, 1% LDS, 0.5% Nonidet-P 
40, 1x protease inhibitor) at room temperature 
followed by treatment with Turbo DNase I in 
DNase I digestion buffer with the addition of 
0.3 M LiCl, protease inhibitors, and superaseIn at 
37°C for 20 min. Then, beads were resuspended 
and washed two more times in the wash buffer. 
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For MS analysis, elution was done in elution 
buffer (10 mM Tris, pH 7.5, 1 mM EDTA) at 70°C 
for 4 min followed by brief sonication in Covaris. 
For the quantification of pulldown efficiency, 
MEFs, without cross-linking, were used and elu- 
tion was done at 95°C. The elute was used for 
RNA isolation and reverse transcription qPCR 
(RT-qPCR). When cross-linked MEFs were used, 
elute was subjected for proteinase-K treatment 
(50 mM Tris pH 7.5, 100 mM NaCl, 0.5% SDS, 10 
ug proteinase K) for 1 hour at 55°C. RNA was 
isolated by Trizol and quantified with SYBR 
green qPCR. Input samples were used to make 
standard curve by 10-fold dilutions, to which the 
RNA pulldown efficiencies were compared and 
calculated. The efficiency of Xist pulldown was 
relatively lower after UV cross-linking, similar to 
(48, 49). 


Quantitative proteomics 


Proteins co-enriched with Xist from female or 
male cells were quantitatively analyzed either 
using a label-free approach based on spectral 
counting (27) or by multiplexed quantitative pro- 
teomics using tandem-mass tag (TMT) reagents 
(50, 51) on an Orbitrap Fusion mass spectrometer 
(Thermo Scientific). Disulfide bonds were reduced 
with ditheiothreitol (DTT) and free thiols alkylated 
with iodoacetamide as described previously (22). 
Proteins were then precipitated with tricholoracetic 
acid, resuspended in 50 mM HEPES (pH 8.5) and 
1M urea, and digested first with endoproteinase 
Lys-C (Wako) for 17 hours at room temperature 
and then with sequencing-grade trypsin (Promega) 
for 6 hours at 37°C. Peptides were desalted over 
Sep-Pak Cs solid-phase extraction (SPE) cartridges 
(Waters), and the peptide concentration was de- 
termined using a bicinchoninic acid (BCA) assay 
(Thermo Scientific). For the label-free analysis, 
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peptides were then dried and resuspended in 5% 
formic acid (FA) and 5% acetonitrile (ACN), and 
5 ug of peptides were analyzed by mass spec- 
trometry as described below. For the multiplexed 
quantitative analysis, a maximum of 50 ug of pep- 
tides were labeled with one out of the available 
TMT-10plex reagents (Thermo Scientific) (50). 
To achieve this, peptides were dried and resus- 
pended in 50 ul of 200 mM HEPES (pH 8.5) and 
30% (ACN), and 10 pg of the TMT in reagent in 
5 ul of anhydrous ACN was added to the solution, 
which was incubated at room temperature (RT) 
for 1 hour. The reaction was then quenched by 
adding 6 ul of 5% (w/v) hydroxylamine in 200 mM 
HEPES (pH 8.5) and incubated for 15 min at RT. 
The labeled peptide mixture was then subjected 
to a fractionation using basic pH reversed-phase 
liquid chromatography (bRPLC) on an Agilent 
1260 Infinity high-performance liquid chroma- 
tography (HPLC) system equipped with an Agilent 
Extend-C18 column (4.6 x 250 mm; particle size, 
5 um), basically as described previously (52). Pep- 
tides were fractionated using a gradient from 
22 to 35% ACN in 10 mM ammonium bicarbo- 
nate over 58 min at a flow rate of 0.5 ml/min. 
Fractions of 0.3 ml were collected into a 96-well 
plate to then be pooled into a total 12 fractions 
(A1 to A12, B1 to B12, etc.) that were dried and 
resuspended in 8 ul of 5% FA and 5% ACN, 3 of 
which were analyzed by microcapillary liquid 
chromatography tandem mass spectrometry on an 
Orbitrap Fusion mass spectrometer, and a recently 
introduced multistage (MS3) method was used to 
provide highly accurate quantification (53). 

The mass spectrometer was equipped with 
an EASY-nLC 1000 integrated autosampler and 
HPLC pump system. Peptides were separated 
over a 100-m inner diameter microcapillary col- 
umn in-house packed with first 0.5 cm of Magic 
C4 resin (5 um, 100 A, Michrom Bioresources), 
then with 0.5 cm of Maccel Cy resin (3 um, 200 A, 
Nest Group) and 29 cm of GP-C18 resin (1.8 tm, 
120 A, Sepax Technologies). Peptides were eluted 
applying a gradient of 8 to 27% ACN in 0.125% 
formic acid over 60 min (label-free) and 165 min 
(TMT) at a flow rate of 300 nl/min. For label-free 
analyses, we applied a tandem-MS method where 
a full-MS spectrum [MS1; mass-to-charge ratio 
(m/z) 375 to 1500; resolution 6 x 10*; automated 
gain control (AGC) target, 5 x 10°; maximum 
injection time, 100 ms] was acquired using the 
Orbitrap, after which the most abundant peptide 
ions where selected for linear ion trap CID-MS2 
in an automated fashion. MS2 scans were done 
in the linear ion trap using the following set- 
tings: quadrupole isolation at an isolation width 
of 0.5 Th; fragmentation method, CID; AGC tar- 
get, 1 x 10*; maximum injection time, 35 ms; nor- 
malized collision energy, 30%). The number of 
acquired MS2 spectra was defined by setting the 
maximum time of one experimental cycle of MS1 
and MS2 spectra to 3 s (top speed). To identify 
and quantify the TMT-labeled peptides, we ap- 
plied a synchronous precursor selection MS3 meth- 
od (22, 53, 54) in a data-dependent mode. The 
scan sequence was started with the acquisition of 
a full MS or MS1 spectrum acquired in the Orbi- 
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trap (m/z range, 500 to 1200; other parameters 
were set as described above), and the most in- 
tense peptide ions detected in the full MS spec- 
trum were then subjected to MS2 and MS3 analysis, 
while the acquisition time was optimized in an 
automated fashion (top speed, 5 s). MS2 scans 
were performed as described above. Using syn- 
chronous precursor selection, the 10 most abun- 
dant fragment ions were selected for the MS3 
experiment after each MS2 scan. The fragment 
ions were further fragmented using the higher- 
energy collisional dissociation (HCD) fragmentat- 
ion (normalized collision energy, 50%), and the 
MS3 spectrum was acquired in the Orbitrap (re- 
solution, 60,000; AGC target, 5 x 10*; maximum 
injection time, 250 ms). 

Data analysis was performed on an in-house 
generated SEQUEST-based (55) software platform. 
RAW files were converted into the mzXML format 
using a modified version of ReAdW.exe. MS2 
spectra were searched against a protein sequence 
database containing all protein sequences in the 
mouse UniProt database (downloaded 4 February 
2014), as well as that of known contaminants 
such as porcine trypsin. This target component 
of the database was followed by a decoy com- 
ponent containing the same protein sequences 
but in flipped (or reversed) order (56). MS2 spec- 
tra were matched against peptide sequences, with 
both termini consistent with trypsin specificity 
and allowing two missed trypsin cleavages. The 
precursor ion m/z tolerance was set to 50 parts 
per million; TMT tags on the N terminus and on 
lysine residues (229.162932 Da, only for TMT analy- 
ses), as well as carbamidomethylation (57.021464 
Da) on cysteine residues were set as static mod- 
ification; and oxidation (15.994915 Da) of methio- 
nines was set as variable modification. Using the 
target-decoy database search strategy (56), a spec- 
tra assignment false discovery rate (FDR) of less 
than 1% was achieved through using linear dis- 
criminant analysis, with a single discriminant score 
calculated from the following SEQUEST search 
score and peptide sequence properties: mass de- 
viation, XCorr, dCn, number of missed trypsin 
cleavages, and peptide length (57). The probability 
of a peptide assignment to be correct was cal- 
culated using a posterior error histogram, and 
the probabilities for all peptides assigned to a 
protein were combined to filter the data set for 
a protein FDR of less than 1%. Peptides with 
sequences that were contained in more than one 
protein sequence from the UniProt database were 
assigned to the protein with the most matching 
peptides (57). 

For a quantitative estimation of protein con- 
centration using spectral counts, we counted the 
number of MS2 spectra assigned to a given pro- 
tein (table S1). TMT reporter ion intensities were 
extracted as that of the most intense ion within a 
0.03 Th window around the predicted reporter 
ion intensities in the collected MS3 spectra. Only 
MS3 with an average signal-to-noise value of larger 
than 28 per reporter ion, as well as with an iso- 
lation specificity (22) of larger than 0.75, were con- 
sidered for quantification. Reporter ions from all 
peptides assigned to a protein were summed to 


define the protein intensity. A two-step normal- 
ization of the protein TMT intensities was per- 
formed by first normalizing the protein intensities 
over all acquired TMT channels for each protein 
based to the median average protein intensity 
calculated for all proteins. To correct for slight 
mixing errors of the peptide mixture from each 
sample, a median of the normalized intensities 
was calculated from all protein intensities in each 
TMT channel, and the protein intensities were 
normalized to the median value of these median 
intensities. 


UV RIP 


The protocol followed is similar to the one de- 
scribed in (78). Briefly, MEFs were cross-linked 
with UV light at 200 mJ and collected by scrap- 
ing in PBS. Cell pellets were resuspended in 
CSKT-0.5% for 10 min at 4°C followed by a spin. 
The nuclei were resuspended in the UV RIP buf- 
fer [PBS buffer containing 300 mM NaCl (total), 
0.5% Nonidet-P 40, 0.5% sodium deoxycholate, 
200 U Protector RNase Inhibitor and 1x protease 
inhibitors] with Turbo DNase I 30 U/IP for 
30 min at 37°C. Supernatants were collected af- 
ter a spin and incubated with 5 ug specific anti- 
bodies prebound to 40 ul protein-G magnetic 
beads (Invitrogen) at 4°C overnight. Beads were 
washed three times with cold UV RIP buffer. The 
beads were resuspended in 200 ul Turbo DNase I 
buffer with 20 U Turbo DNase, SuperaseIN, 1x 
protease inhibitors for 30 min at 37°C. The beads 
were resuspended and washed three more times 
in the UV RIP washing buffer containing 10 mM 
EDTA. The final three washes were given after 
threefold dilution of UV RIP washing buffer. The 
beads were resuspended in 200 ul proteinase-K 
buffer with 10 ug proteinase-K and incubated at 
55°C for 1 hour. RNA was isolated by Trizol, and 
pulldown efficiencies were calculated by SYBR 
qPCR using input for the standard curve. 


Generation of Xi-TgGFP clonal fibroblasts 


Xi-TgGFP (68-5-11) tail-tip fibroblasts (TTF) were 
initially derived from a single female pup, a 
daughter of a cross between a M. castaneus male 
and a M. musculus female, homozygous for an 
X-linked GFP transgene driven by a strong, ubi- 
quitous promoter (58). The fibroblasts were im- 
mortalized by SV40 transformation, and clonal 
lines were derived from individual GFP-negative 
cells selected by fluorescence-activated cell sort- 
ing. In our experience, occasional clones with 
undetectable GFP expression nevertheless have 
the transgene located on the active X chromosome. 
Thus, we confirmed the GFP transgene location 
on the inactive X for the particular clone used 
here, 68-5-11 (see fig. S2). 


Generation of stable KD of Xi-TgGFP TTF 
and 16.7 ES cells 


The protocol is as described in http://www. 
broadinstitute.org/rnai/public/resources/protocols 

A cocktail of three shRNA viruses was used for 
infections (table S2) followed with puromycin 
selection. In all the experiments, nonclonal knock- 
down cells were used. 
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Assay for the reactivation of Xi-TgGFP 
About 125,000 to 150,000 Xi-TgGFP (68-5-11) cells 
were plated along with control (shControl) cells 
treated with dimethyl sulfoxide or stable KD 
cells treated with 0.3 uM azacytidine and 0.3 1M 
Etoposide for 3 days in six well plates. RNA was 
isolated by Trizol twice, with an intermittent 
TurboDNase treatment after the first isolation 
for 30 min at 37°C. One ug RNA was used for 
each of the RT+ and RT- reactions (Superscript 
III, Invitrogen) followed by the SYBR green 
qPCR using the primers listed in table $3, with 
annealing temperature of 60°C for 45 cycles. The 
relative efficiency of Xi-TgGFP reactivations was 
calculated by comparing to U1 snRNA as the 
internal control. 


ImmunoFISH 


Cells were grown on coverslips, rinsed in PBS, 
pre-extracted in 0.5% CSKT on ice, and washed 
once in CSK, followed by fixation with 4% para- 
formaldehyde in PBS at room temperature. After 
blocking in 1% bovine serum albumin in PBS for 
20 min supplemented with 10 mM ribonucleoside- 
vanadyl complex (VRC) (New England Biolabs) 
and RNase inhibitor (Roche), incubation was 
carried out with primary antibodies (table S3) 
at room temperature for 1 hour. Cells were 
washed three times in PBST-0.02% Tween-20. 
After incubating with secondary antibody at 
room temperature for 30 min, cells were washed 
three times by PBS/0.02% Tween-20. Cells were 
fixed again in 4% paraformaldehyde and dehy- 
drated in ethanol series. RNA FISH was per- 
formed using a pool of Cy3B- or Alexa 568-labeled 
Xist oligonucleotides for 4 to 6 hours at 42°C ina 
humid chamber. Cells were washed three times 
in 2X SSC, and nuclei were counter-stained by 
Hoechst 33342. Cells were observed under Nikon 
90i microscope equipped with 60X/1.4 N.A. ob- 
jective lens, Orca ER charge-coupled device camera 
(Hamamatsu), and Volocity software (Perkin Elmer). 
Xist RNA FISH probes, a set of total 37 oligo- 
nucleotides with 5’ amine modification (IDT), were 
labeled with NHS-Cy3B (GE Healthcare) overnight 
at room temperature followed by ethanol precip- 
itation. In the case of confirmation of Xi-TgGFP 
cells, probes were made by nick translation of a 
GFP PCR product with Cy3-dUTP and of a plas- 
mid containing the first exon of the mouse Xist 
gene, with FITC-dUTP. 


Allelic ChIP-seq 


Allele-specific ChIP-seq was performed accord- 
ing to the method of Kung et al. (25), in two 
biological replicates. To increase available read 
depth, we pooled together two technical repli- 
cates for Xi**“*'/xa"" Rad21 replicate 1 sequenced 
on a 2 x 50 bp HiSeq. 2500 rapid run, and we 
also pooled two technical replicates of wild-type 
Rad21 replicate 1, one sequenced on a HiSeq. 2 x 
50 bp run and one on a MiSeq. 2.x 50 bp run. All 
other libraries were sequenced on using 2 x 50 bp 
HiSeq. 2500 rapid runs. To visualize ChIP bind- 
ing signal, we generated fragments per million 
(fpm)-normalized bigWig files from the raw ChIP 
read counts for all reads (comp), mus-specific 
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(mus), and cas-specific reads separately. For Smcla, 
CTCF, and Rad21, peaks were called using macs2 
with default settings. To generate consensus peak 
sets for all three epitopes, peaks for the two wild- 
type and Xi*/Xa" replicates were pooled, and 
peaks present in at least two experiments were 
used as the common peak set. To make compar- 
isons between allelic read counts between differ- 
ent experiments, we defined a scaling factor as 
the ratio of the total read numbers for the two 
experiments and multiplied the allelic reads for 
each peak in the larger sample by the scaling fac- 
tor. We plotted the number of reads on Xi vs Xa in 
wild-type for all peaks on the X-chromosome to 
determine whether there is a general bias toward 
binding to the Xa or the Xi. To evaluate allelic 
skew on an autosome, we generated plots of mus 
read counts versus cas read counts for all peaks on 
chromosome 5 from 1 to 140,000,000. We used 
this particular region of chromosome 5 because 
Xi*/Xq" is not fully hybrid, and this is a large 
region of an autosome that is fully hybrid based 
on even numbers of read counts from input and 
from our Hi-Cs over this region in Xi***/xXa"" 
(data not shown). To identify peaks that are 
highly Xa-skewed in wild-type but bind substan- 
tially to the Xi in Xi*“/xa" (restored peaks), for 
Xa-skewed peaks in wild-type, we plotted nor- 
malized read counts on Xi in xi xa? versus 
read counts on Xa in wild-type. We defined 
restored peaks as peaks that are (i) more than 3X 
Xa-skewed in wild-type, (ii) have at least five 
allelic reads in wild-type, and (iii) exhibit nor- 
malized read counts on Xi in Xi*/Xa"7 that 
are at least half the level of Xa in wild-type. This 
threshold ensures that all restored peaks have at 
least a 2X increase in binding to the Xi in xis) 
Xa" relative to wild-type. We identified restored 
peaks using these criteria in both replicates of 
Smcla and Rad21 ChIP separately, and to merge 
these calls into a consensus set for each epitope, 
we took all peaks that met criteria for restoration 
in at least one replicate and had at least 50% 
wild-type Xa read counts on Xi in Xi*™*/Xa"7 
in both replicates. 


Allele-specific RNA-seq 


Xi-TgGFP TTFs (68-5-11) with the stable knock- 
down of candidates were treated with 5'-azacytidine 
and etoposide at 0.3 uM each for 3 days. Strand- 
specific RNA-seq, the library preparation, deep 
sequencing, and data analysis was followed as 
described in (59). Two biological replicates of 
each drug treatment were produced. All libraries 
were sequenced with Illumina HisEq. 2000 or 
2500 using 50 cycles to obtain paired-end reads. 
To determine the allelic origin of each sequenc- 
ing read from the hybrid cells, reads were first 
depleted of adaptors dimers and PCR duplicates, 
followed by the alignment to custom mus/129 and 
cas genomes to separate mus and cas reads. After 
removal of PCR duplicates, ~90% of reads were 
mappable. Discordant pairs and multimapped 
reads were discarded. Reads were then mapped 
back to reference mm9 genome using Tophat 
v2.0.10 (-g 1-no-coverage-search-read-edit-dist 
3-read-mismatches 3-read-gap-length 3-b2-very- 


sensitive-mate-inner-dist 50-mate-std-dev 50- 
library-type fr-firststrand), as previously described 
(25, 32, 59). After alignment, gene expression 
levels within each library were quantified using 
Homer v4.7 (rna mm9 -count genes -strand + 
-noadj -condenseGenes) (59), and the normalized 
differential expression analyses across samples 
were performed by using EdgeR (60). 


Hi-C library preparation and analysis 


Hi-C libraries were generated according to the 
protocol in Lieberman-Aiden et al. (61). Two bio- 
logical replicate libraries were prepared for 
wild-type and Xi*“*'/XaT fibroblasts each. We 
obtained 150 to 220 million 2 x 50 bp paired-end 
reads per library. The individual ends of the 
read-pairs were aligned to the mus and cas refe- 
rence genomes separately using novoalign with 
default parameters for single-end alignments, and 
the quality score of the alignment was used to 
determine whether each end could be assigned 
to either the mus or the cas haplotype (62). The 
single-end alignments were merged into a Hi-C 
summary file using custom scripts. Reads were 
filtered for self-ligation events and short fragments 
(less than 1.5X the estimated insert length) likely 
to be random shears using Homer (59, 63). Hi-C 
contact maps were generated using Homer. “Comp” 
maps were made from all reads. “Xi” and “Xa” 
reads were from reads where at least one read- 
end could be assigned to either the mus or cas 
haplotype, respectively. A small fraction of reads 
(~5% of all allelic reads) aligned such that one end 
aligned to mus, the other to cas. These “discor- 
dant” reads were excluded from further analysis, 
because they are likely to be noise arising due to 
random ligation events and/or improper single- 
nucleotide polymorphism (SNP) annotation 
(46, 64). All contact maps were normalized using 
the matrix balancing algorithm of Knight and 
Ruiz (65), similar to iterative correction (46, 66), 
using the MATLAB script provided at the end of 
their paper. We were able to generate robust con- 
tact maps using the comp reads in one replicate 
at 40-kb resolution, but because only ~44% of 
reads align allele-specifically, we were only able 
to generate contact maps for the cas and mus 
haplotypes at 200 kb. To increase our resolution, 
we pooled together both biological replicates and 
analyzed the comp contact map at 40-kb reso- 
lution and the mus and cas contact maps at 100 kb. 
We called TADs at 40 kb on the X chromosome 
Chr5, and Chr13 using the method of Dixon e¢ al. 
(27). Specifically, we processed the normalized 
comp 40-kb contact maps separately into a vec- 
tor of directionality indices using DI_from_matrix. 
pl with a bin size of 40,000 and a window size of 
200,000. We used this vector of directionality 
indices as input for the HMM_calls.m script, and 
after HMM_ generation, we processed the HMM 
and generated TAD calls by passing the HMM 
output to file_ends_cleaner.pl, converter_7col.pl, 
hmm_probablity_correcter.pl, hmm-state_caller. 
pl, and, finally, hmm-state_domains.pl. We used 
parameters of min = 2, prob = 0.99, binsize = 
40,000 as input to the HMM probability correc- 
tion script. 
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To create a general metric describing interac- 
tion frequencies within TADs at resolution avail- 
able in the allele-specific interaction maps, for 
each TAD, on the X chromosome and Chr5 we 
averaged the normalized interaction scores for 
all bins within each TAD, excluding the main 
diagonal. To make comparisons between inter- 
action frequency over TADs between the cas (Xa) 
and mus (Xi) haplotypes at the resolution avail- 
able with our current sequencing depth, we de- 
fend the “fraction mus” as the average interaction 
score for a TAD in the mus contact map divided 
by the sum of the average interaction scores in the 
mus and cas contact maps. 

To discover TADs that show significantly in- 
creased interaction frequency in Xi*“"*/xa"", we 
generated a null distribution of changes in aver- 
age normalized interaction scores for all TADs 
on chromosome 5, 1 to 140 Mb using the cas and 
mus contact maps. We reasoned that there would 
be few changes in interaction frequency on an 
autosome between the mus or cas contact maps 
for wild-type and Xi***'/Xa7; thus, the distri- 
bution of fold changes in interaction score on an 
autosome constitutes a null distribution. Using 
this distribution of fold changes allowed us to 
calculate a threshold fold change for an empir- 
ical FDR of 0.05, and all TADs that had a greater 
increase in average normalized interaction score 
on Xi between wild-type and Xi*“*'/Xa"7 were 
considered restored TADs. We performed this 
analysis of restored TADs separately in each bio- 
logical replicate using the 200-kb contact maps 
to generate interaction scores over TADs and using 
the combined data at 100-kb resolution. 
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MAGNETISM 


Blowing magnetic skyrmion bubbles 


Wanjun Jiang,’ Pramey Upadhyaya,” Wei Zhang,’ Guogiang Yu,” 
M. Benjamin Jungfleisch,’ Frank Y. Fradin,’ John E. Pearson,’ Yaroslav Tserkovnyak,* 
Kang L. Wang,” Olle Heinonen,’*’*** Suzanne G. E. te Velthuis,’ Axel Hoffmann’* 


The formation of soap bubbles from thin films is accompanied by topological transitions. 
Here we show how a magnetic topological structure, a skyrmion bubble, can be generated 
in a solid-state system in a similar manner. Using an inhomogeneous in-plane current 

in a system with broken inversion symmetry, we experimentally “blow” magnetic skyrmion 
bubbles from a geometrical constriction. The presence of a spatially divergent spin-orbit 
torque gives rise to instabilities of the magnetic domain structures that are reminiscent of 
Rayleigh-Plateau instabilities in fluid flows. We determine a phase diagram for skyrmion 
formation and reveal the efficient manipulation of these dynamically created skyrmions, 
including depinning and motion. The demonstrated current-driven transformation from stripe 
domains to magnetic skyrmion bubbles could lead to progress in skyrmion-based spintronics. 


agnetic skyrmions are topological spin 

textures that can be stabilized by Dzyalo- 

shinskii-Moriya interactions (DMIs) (/-9) 

in chiral bulk magnets such as MnSi, 

FeGe, etc. Owing to their distinct vortex- 
like spin texture, skyrmions exhibit many fascinating 
features, including emergent electromagnetic 
fields, which enable their efficient manipulation 
(4, 5, 8-10). A particularly technologically inte- 
resting property is that skyrmions can be driven 
by a spin-transfer torque mechanism at a very 
low current density, which has been demonstrated 
at cryogenic temperatures (4, 5, 8, 10). Besides bulk 
chiral magnetic interactions, the interfacial sym- 
metry breaking in heavy metal/ultrathin ferro- 
magnet/insulator (HM/F/I) trilayers introduces 
an interfacial DMI (17-14) between neighboring 
atomic spins, which stabilizes Néel walls (cycloi- 
dal rotation of the magnetization direction) with 
a fixed chirality over the Bloch walls (spiral rota- 
tion of the magnetization direction) (15-20). This 
is expected to result in the formation of skyrmions 
with a “hedgehog” configuration (/4, 18, 21-25). 
This commonly accessible material system ex- 
hibits spin Hall effects from heavy metals with 
strong spin-orbit interactions (26), which in turn 
give rise to well-defined spin-orbit torques (SOTs) 
(7, 19, 27-29) that can control magnetization dy- 
namics efficiently. However, it has been experi- 
mentally challenging to use the electric current 
and/or its induced SOTs (8, 21, 23, 24, 27, 30-32) 
for dynamically creating and/or manipulating 
hedgehog skyrmions. Here we address that issue. 
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Central to this work is how electric currents 
can manipulate a chiral magnetic domain wall 
(DW); that is, the chirality of the magnetization 
rotation (as shown in Fig. 1A) is identical for 
every DW. This fixed chirality is stabilized by 
the interfacial DMI (17-19, 21, 28). In HM/F/I 
heterostructures, the current flowing through 
the heavy metal generates a transverse vertical 
spin current attributable to the spin Hall effect 
(27), which results in spin accumulation at the 
interface with the ferromagnetic layer. This spin 
accumulation gives rise to a SOT acting on the 
chiral DW (Fig. 1A). The resultant effective spin 
Hall field can be expressed as (17-19, 27) 


Ban = BS, bn x (& x Je)] (1) 


where m is the magnetization unit vector, 2 is 
the unit vector normal to the film plane, and j, 
is the direction of electron particle flux. Here BS, 
can be written as (//2\e|) - (OnJe/tM;), where 
h/2 is the spin of an electron (and h is Planck’s 
constant h divided by 27), e is the charge of an 
electron, ¢; is the thickness of the ferromagnetic 
layer, and MW, is the saturation (volume) magneti- 
zation. The spin Hall angle 0. = Js/Je is de- 
fined by the ratio between spin current density 
(J,) and charge current density (J.). For homo- 
geneous current flow along the x axis (Fig. 1B), a 
chiral SOT enables efficient DW motion (17-19). 
In the case of a stripe domain with a chiral DW 
(Fig. 1B), the symmetry of Eq. 1 leads to a vanish- 
ing torque on the side walls parallel to the cur- 
rent, and therefore only the end of the stripe 
domain is moved. If the opposite end is pinned, 
this results in an elongation of the stripe. 

The situation becomes more complex when the 
stripe domain is subjected to an inhomogeneous 
current flow. This can be achieved by introducing 
a geometrical constriction into a current-carrying 
trilayer wire (Fig. 1C). Such a constriction results 
in an additional current component along the y 
axis: jy around the narrow neck (Fig. 1D). The total 
current j is spatially convergent (or divergent) to 
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Fig. 1. Schematic of the transformation of stripe 
domains into magnetic skyrmion bubbles. (A) 
Infinitesimal section of a chiral DW in a ferromagnet 
(F)/heavy metal (HM) bilayer illustrating the relation 
between local magnetization vectors and the SOT- 
induced chiral DW motion of velocity Vay, in a device 
with a homogeneous electron current flow j. along 
the +x axis. Blue corresponds to upward orientation 
of magnetization, whereas orange represents the down- 
ward orientation of magnetization. The bottom panel 
illustrates the magnetization directions inside of the 
Néel wall. (B) Top view of a trilayer device. The blue 
region is a stripe-shaped domain. Light blue arrows 
show the in-plane magnetization direction of the DW 
[as shown in the bottom panel of (A)] and indicate 
that the domain has left-handed chirality. The red 
arrows correspond to the current distribution. (C) 
Introducing a geometrical constriction into the de- 
vice gives rise to an inhomogeneous current distrib- 
ution, which generates a flow along the y axis (jy) 
around the narrow neck. This current distribution is 
spatially divergent to the right and convergent to 
the left of the constriction. The y component of the 
current distribution is highlighted in (D). This intro- 
duces an effective spin Hall force (FY,) along the 
y axis that (E) locally expands the stripe domain on 
the right side. (F) Once the expansion approaches 
a critical point, the resultant restoring forces (Fres) 
associated with the surface tension of the DWs are 
no longer able to maintain the shape, and the stripe 
domains break into circular bubble domains, result- 
ing in the formation of synthetic Néel skyrmions. 
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the left (or right) of the constriction (33). Conse- 
quently, inhomogeneous effective forces (F%,) on 
the DWs (caused by the spin Hall field) are created 
along the y axis; these forces act to expand the end 
of the domain (Fig. IE). As the domain end contin- 
ually expands its radius, the surface tension in 
the DW (resulting from the increasing DW energy 
determined by the combination of exchange and 
anisotropy fields) increases (34), which results in 
breaking the stripes into circular domains (Fig. 1F). 

This process resembles how soap bubbles de- 
velop out of soap films upon blowing air through 
a straw, or how liquid droplets form in fluid flow 
jets (35). Because of the interfacial DMI in the 
present system, the spin structures of the newly 
formed circular domains maintain a well-defined 
deft-handed) chirality (13, 14, 23, 24). Once formed, 
these created synthetic hedgehog (Néel) skyrmions 
(14, 23) are stable due to topological protection 
and move very efficiently following the current 
direction, a process that can be described on the 
basis of a modified Thiele equation (36). The dy- 
namic skyrmion conversion could, in principle, 
happen at the other side of the device, where the 
spatially convergent current compresses stripe 
domains. However, sizeable currents and SOTs are 
required to compensate the enhanced (repulsive) 
dipolar interaction. The proposed mechanism dif- 
fers from a recent theoretical proposal with sim- 
ilar geometry, in which skyrmions are formed 
from the coalescence of two independent DWs 
extending over the full width of a narrow constric- 
tion at a current density ~10° A/cm? (32). For 
repeated skyrmion generation, this latter mech- 
anism requires a continuous generation of paired 
DWs in the constriction, which is inconsistent with 
the experimental observations described below. 


Transforming chiral stripe domains 
into skyrmions 


We demonstrated this idea experimentally with 
a Ta(S5 nm)/Co9F €g9Bo9(CoFeB\(L.1 nm)/TaO,(3 nm) 
trilayer grown by magnetron sputtering (37, 38) 
and patterned into constricted wires via photo- 
lithography and ion milling (33). The wires have 
a width of 60 xm with a 3-.m-wide and 20-um-long 
geometrical constriction in the center. Our devices 
are symmetrically designed across the narrow neck 
to maintain balanced demagnetization energy. A 
polar magneto-optical Kerr effect (MOKE) micro- 
scope in a differential mode (39) was used for dy- 
namic imaging experiments at room temperature. 
Before applying a current, the sample was first sat- 
urated at positive magnetic fields and subsequently 
a perpendicular magnetic field of B, = +0.5 mT 
was applied; sparse magnetic stripe and bubble 
domains prevail at both sides of the wire (Fig. 2A). 
The lighter area corresponds to positive perpendic- 
ular magnetization orientation, and the darker area 
corresponds to negative orientation, respectively. 

In contrast to the initial magnetic domain config- 
uration, after passing a 1-s single pulse of current den- 
sity j. = +5 x 10° A/cm? (normalized by the width of 
the device: 60 tm), it is observed that the stripe do- 
mains started to migrate, subsequently forming 
extended stripe domains on the left side. These do- 
mains were mostly aligned with the charge current 
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flow and converged at the left side of constriction. 
The stripes were transformed into skyrmion bubbles 
immediately after passing through the constric- 
tion (Fig. 2B). These dynamically created skyrmions, 
varying in size between 700 nm and 2 um (depend- 
ing on the strength of the external magnetic field), 
are stable and do not decay on the scale of a typical 
laboratory testing period (at least 8 hours). The size 
of the skyrmions is determined by the interplay be- 
tween Zeeman, magnetostatic interaction and 
interfacial DMI. In the presence of a constant 
electron current density of j. = +5 x 10° A/cm’, 
these skyrmions are created with a high speed 
close to the central constriction and destroyed at 
the end of the wire. Capturing the transformation 
dynamics of skyrmions from stripe domains is 
beyond the temporal resolution of the present set- 
up. Reproducible generation of skyrmions is dem- 
onstrated by repeating pulsed experiments several 
times (33). The left side of the device remains mainly 
in the labyrinthine stripe domain state after remov- 
ing the pulse current, which indicates that both 
skyrmion bubbles and stripe domains are metastable. 

When the polarity of the charge current is re- 
versed to je = -5 x 10° A/cm”, the skyrmions are 
formed at the left side of the device (Fig. 2, C and 
D). This directional dependence indicates that the 
spatially divergent current and SOT, determined 
by the geometry of the device, are most likely 
responsible for slicing stripe DWs into magnetic 
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skyrmion bubbles, qualitatively consistent with the 
schematic presented in Fig. 1. 

At a negative magnetic field B, = -0.5 mT and 
current j. = +5 x 10° A/cm? (Fig. 2, E and F), a 
reversed contrast, resulting from opposite inner 
and outer magnetization orientations, is observed 
compared with positive fields. We varied the exter- 
nal magnetic field and charge current density sys- 
tematically and determined the phase diagram for 
skyrmion formation shown in Fig. 2G. A large pop- 
ulation of synthetic skyrmions is found only in the 
shadowed region, whereas in the rest of phase 
diagram the initial domain configurations remain 
either stationary or flowing smoothly, depending 
on the strength of current density, as discussed 
below. This phase diagram is independent of pulse 
duration for pulses longer than 1 us. No creation of 
skyrmions in a regular-shaped device with a homo- 
geneous current flow (as illustrated in Fig. 1B) is ob- 
served up to a current density of j, = +5 x 10° A/cm”. 


Capturing the transformational dynamics 


The conversion from chiral stripe domains into 
magnetic skyrmions can be captured by decreasing 
the driving current, which slows down the trans- 
formational dynamics. Figure 3, A to D, shows the 
dynamics for a constant current density of j. = 
+6.4 x 10" A/cm? at B, = +0.46 mT. The origi- 
nal (disordered) labyrinthine domains on the left 
side squeeze to pass through the constriction (Fig. 
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Fig. 2. Experimental generation of magnetic skyrmions. (A) Sparse irregular domain structures are 


observed at both sides of the device at a perpendicular magnetic field of B, = 


+0.5 mT. (B) Upon passing a 


current of j. = +5 x 10° A/cm? through the device, the left side of the device develops predominantly 
elongated stripe domains, whereas the right side converts into dense skyrmion bubbles. (C and D) By re- 
versing the current direction to j. = —5 x 10° A/cm®, the dynamically created skyrmions are forming at the left 
side of the device. (E and F) Changing the polarity of the external magnetic field reverses the internal and 
external magnetization of these skyrmions. (G) Phase diagram for skyrmion formation. The shaded area in- 
dicates field-current combinations that result in the persistent generation of skyrmions after each current pulse. 
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3B). The stripe domains become unstable after 
passing through the constriction and are even- 
tually converted into skyrmions on the right side 
of the device, as shown in Fig. 3, C and D. This 
can be seen in more detail in the supplementary 
MOKE movies (movies S1 and S2). Because the x 
component of the current results in an efficient 
motion of DWs, the skyrmion formation can 
occur away from the constriction. The synthetic 
skyrmions do not merge into stripe domains and, 
in fact, repel each other, indicating their topological 
protection as well as magnetostatic interactions. 

Some important features should be noticed. 
There exists a threshold current j.... = +6 x 
10* A/cm? for persistently generating skyrmion 
bubbles from stripe domains for pulses longer than 
lus. Above this current, the enhanced spin-orbit 
torques produce the instability of the DWs, which 
results in the continuous formation of skyrmions. 
The present geometry for skyrmion generation is 
very efficient, resulting in the observed threshold 
current that is three orders of magnitude smaller 
than suggested by previous simulation studies 


Fig. 3. Capturing the transfor- 
mational dynamics from stripe 
domains to skyrmions and 
motion of skyrmions. (A to D) 
At a constant current density je = 
+6.4 x 10% A/cm? and 

B, =+0.46 mT, the disordered 
stripe domains are forced to 
pass through the constriction 
and are eventually converted 
into skyrmions at the right side 
of the device. Red circles 
highlight the resultant newly 
formed skyrmions. (E) Illustra- 
tion of the effective spin Hall 
field acting on these dynami- 
cally created skyrmions; the 
direction of motion follows the 
electron current. (F to I) Effi- 
cient motion of these skyrmions 
for a current density j, = +3 x 

10* A/cm®. (F) First, a 1-s-long 
single pulse je = +5 x 10° A/cm? 
initializes the skyrmion state. (G to 
) Subsequently, smaller currents 
below the threshold current to 
avoid generating additional skyr- 
mions through the constriction) are 
used to probe the current-velocity 
relation. These skyrmions are 
migrating stochastically and moving 
out of the field of view. See supple- 
mentary MOKE movies S1, S2, and 
S4 for the corresponding temporal 
dynamics. (J) The current-velocity 
dependence of skyrmions is ac- 
quired by studying =20 skyrmions 
via averaging their velocities by 
dividing the total displacement 
with the total time period. 
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(10’ to 10° A/cm?) in MnSi thin films with a bulk 
DMI where the driving mechanism is the conven- 
tional spin-transfer torque (30). Below this threshold 
for continuous skyrmion generation, there is a 
threshold depinning current jo. = +41 x 10* A/em? 
that produces a steady motion of stripe domains. 
The force (pressure) on the stripe from SOT at 
this current exceeds the one required to main- 
tain its shape. When jest < Je < Je-sk, the stripe 
domains are moving smoothly through the con- 
striction and prevail at both sides of devices, 
with just the occasional formation of skyrmions. 


Depinning and motion of synthesized 

S = 1 skyrmions 

The magnetic skyrmion bubbles discussed so far 
have a topological charge given by the skyrmion 
number S = 1, as is determined by wrapping the 
unit magnetization vector (m) over the sphere 
S= V4q [im - (0pm x Oym)daxdy (1, 8). These S = 
1 synthetic skyrmions move because of the op- 
posite direction of effective SOTs on the oppo- 
site sides of the skyrmion (Fig. 3E). Following the 
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initialization by a current pulse j. = +5 x 10° A/cm? 
(which is larger than the threshold current j..., for 
generating skyrmions), we studied the efficient 
depinning and motion of synthetic skyrmions 
(Fig. 3, F to I) at B, = —0.5 mT. At the current den- 
sity j. = +3 x 10* A/cm”, there is no migration of 
stripe domains through the constriction (hence an 
absence of newly formed skyrmions). However, it 
is clear that the previously generated skyrmions 
at the right side of the device are gradually moving 
away following the electron flow direction. During 
the motion, no measurable distortion of these syn- 
thetic skyrmions is observed within the experi- 
mental resolution, consistent with the well-defined 
chirality of the skyrmion bubble. The average ve- 
locity (0 = £/At) is determined by dividing the dis- 
placement (/) with the total time period (At). For 
the present current density, the motion of synthetic 
skyrmion is stochastic and influenced by random 
pinning with an average velocity of ~10 m/s, the 
current dependence of which is summarized in 
Fig. 3J. The ratio of the velocity to the applied 
current is comparable to what is observed for the 
chiral DW motion in the related systems (17, 19). 


Current characteristics of S = O 
magnetic bubbles 


Because of the competition between long-range 
dipolar and short-range exchange interaction, a 
system with a weak perpendicular magnetic an- 
isotropy undergoes a spin reorientation transition 
with in-plane magnetic fields that is typified by a 
stripe-to-bubble domain phase transition (39, 40). 
Such an in-plane field-induced bubble state is estab- 
lished by sweeping the magnetic field from Bi = 
+100 to +10 mT. Current-driven characteristics 
of the in-plane field-induced magnetic bubbles are 
in stark contrast to the mobile magnetic skyrmions 
generated from SOTs. These bubbles shrink and 
vanish in the presence of a positive electron current 
density (Fig. 4, A to E) or elongate and transform 
into stripe domains in the presence of negative elec- 
tron current density (Fig. 4, F to J). Such a distinct 
difference directly indicates the different spin struc- 
tures surrounding these field-induced bubbles and, 
therefore, the different skyrmion numbers. 

For the in-plane field-induced magnetic bub- 
bles, because the spin structures of DWs follow 
the external magnetic fields (18, 41, 42) (Fig. 4K), 
the corresponding skyrmion number is S = 0. Be- 
cause of the same direction of the effective spin 
Hall fields given by the reversed DW orientations, 
topologically trivial S = 0 magnetic bubbles expe- 
rience opposite forces on the DWs at opposite ends. 
This leads to either a shrinking or elongation of the 
bubbles, depending on the direction of currents, 
which is consistent with our experimental obser- 
vation. This also explains the in-plane current- 
induced perpendicular magnetization switching 
in the presence of in-plane fields (27, 42). 


Perspectives 


Recent experimental efforts toward creating in- 
dividual magnetic skyrmions use either tunneling 
current from a low-temperature spin-polarized 
scanning tunneling microscope (43) or geometri- 
cal confinement via sophisticated nanopatterning 
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Fig. 4. Absence of motion for the in-plane magnetic fields stabilized S = O magnetic bubbles. (A to E) 
(A) In-plane magnetic field-induced bubbles are created by first saturating at in-plane field By = +100 mT and 
subsequently decreasing to B) = +10 mT. Depending on the direction of the current, these magnetic bubbles 
either shrink or expand. (A to E) Shrinking bubbles are observed upon increasing the current density from j. = +5 x 
10* to +2.5 x 10° A/cm’ in steps of 5 x 10* A/cm®. (F to J) Expansion of bubbles is revealed for currents from je = 
-0.5 x 10° to -2.5 x 10° A/cm? in steps of 5 x 10 A/cm. (K) These results are linked to the different spin textures 
(namely, S = 0 skyrmion bubbles) that were stabilized along the DW by the in-plane magnetic fields, which lead to 
different orientations of the effective spin Hall fields and different directions of DW motion, as illustrated. 


(44-46). Our results demonstrate that spatially di- 
vergent current-induced SOTs can be an effective 
way for dynamically generating mobile magnetic 
skyrmions at room temperature in commonly ac- 
cessible material systems. The size of these syn- 
thetic skyrmions could be scaled down by properly 
engineering the material-specific parameters that 
control the various competing interactions in mag- 
netic nanostructures (23, 24, 47). We expect that 
similar instabilities will be generated from diver- 
gent charge current flows. Whereas the mechanism 
for synthetic skyrmion generation can be qualita- 
tively linked to the spatially divergent spin Hall spin 
torque, a comprehensive understanding of this dy- 
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namical conversion, particularly at the picosecond 
or nanosecond time scale where the intriguing mag- 
netization dynamics occurs, requires further exper- 
imental and theoretical investigations. Spatially 
divergent SOT-driven structures also offer a readily 
accessible model system for studying topological 
transitions and complex flow instabilities (35), where 
the parameters governing the flow, such as surface 
tension, can be systematically tuned by the mag- 
netic interactions. At the same time, this dynamic 
approach for skyrmion generation in the near future 
could enable the demonstration of advanced skyr- 
mionic device concepts; for example, functional 
skyrmion racetrack memory (/4, 23, 36, 48). 
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HEAVY FERMIONS 


Unconventional Fermi surface in an 


insulating state 


B. S. Tan," Y.-T. Hsu,’ B. Zeng,” M. Ciomaga Hatnean,” N. Harrison,* Z. Zhu,* 
M. Hartstein,’ M. Kiourlappou,’ A. Srivastava,’ M. D. Johannes,” T. P. Murphy,” 
J.-H. Park,” L. Balicas,” G. G. Lonzarich,’ G. Balakrishnan,” Suchitra E. Sebastian’* 


Insulators occur in more than one guise; a recent finding was a class of topological 
insulators, which host a conducting surface juxtaposed with an insulating bulk. Here, 

we report the observation of an unusual insulating state with an electrically insulating 
bulk that simultaneously yields bulk quantum oscillations with characteristics of an 
unconventional Fermi liquid. We present quantum oscillation measurements of magnetic 
torque in high-purity single crystals of the Kondo insulator SmBe, which reveal quantum 
oscillation frequencies characteristic of a large three-dimensional conduction electron 
Fermi surface similar to the metallic rare earth hexaborides such as PrBe and LaBg. The 
quantum oscillation amplitude strongly increases at low temperatures, appearing strikingly 
at variance with conventional metallic behavior. 


ondo insulators, a class of materials posi- 
tioned close to the border between insu- 
lating and metallic behavior, provide fertile 
ground for unusual physics (/-14). This 
class of strongly correlated materials is 
thought to be characterized by a ground state 
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with a small energy gap at the Fermi energy 
owing to the collective hybridization of conduc- 
tion and electrons. The observation of quantum 
oscillations has traditionally been associated with 
a Fermi liquid state; here, we present the sur- 
prising measurement of quantum oscillations in 
the Kondo insulator SmBg (15) that originate from 
a large three-dimensional Fermi surface occupy- 
ing half the Brillouin zone and strongly resem- 
bling the conduction electron Fermi surface in 
the metallic rare earth hexaborides (6, 17). Our 
measurements in SmBg reveal a dramatic depar- 
ture from conventional metallic Lifshitz-Kosevich 
behavior (18); instead of the expected satura- 
tion at low temperatures, a striking increase is 


observed in the quantum oscillation amplitude 
at low temperatures. 

Single crystals of SmB, used in the present 
study were grown by means of the image furnace 
technique (79) in order to achieve high purities 
as characterized by the high inverse residual re- 
sistivity ratio. Single crystals with inverse resist- 
ance ratios [IRR = R(T = 1.8 K)/R(T = 300 K), where 
Ris resistance and T is temperature] of the order 
of 10° were selected for this study; the IRR has 
been shown to characterize crystal quality, with 
the introduction of point defects through radia- 
tion damage (20) or through off-stoichiometry 
(21), resulting in a decrease in low-temperature 
resistance and an increase in high-temperature 
resistance. The resistance of a SmBg single crys- 
tal is shown in Fig. 1B measured as a function of 
temperature at zero magnetic field and in an 
applied DC magnetic field of 45 T, demonstrating 
that activated electrical conductivity character- 
istic of an energy gap ~40 K at the Fermi energy 
persists up to high magnetic fields. The non- 
magnetic ground state of SmB, is evidenced by 
the linear magnetization up to 60 T (Fig. 1B, bot- 
tom inset). 

We observed quantum oscillations in SmBg by 
measuring the magnetic torque. The measure- 
ments were done in magnetic fields up to 40 T 
and down to 7 = 0.4 K and in magnetic fields up 
to 35 T and down to T = 0.03 K. Quantum os- 
cillations periodic in inverse magnetic field are 
observed against a quadratic background, with 
frequencies ranging from 50 to 15,000 T (Fig. 1, 
A, C, and D). A Fourier transform of the quantum 
oscillations is shown in Fig. 2A as a function of 
inverse magnetic field, revealing well-defined peaks 
corresponding to multiple frequencies. The peri- 
odicity of the quantum oscillations in inverse 
magnetic field is revealed by the linear Landau 
index plot in Fig. 2B. 

The observation, especially of rapid quantum 
oscillations with frequencies higher than 10 kT 
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Fig. 1. Quantum oscillations in the magnetic torque in SmBg. (A) Quantum 
oscillations in magnetic torque are visible against a quadratic background. 
(Inset) Schematic of the magnetic torque measurement setup using a 
capacitive cantilever and the notation for angular rotation by angle 0. (B) Re- 
sistance as a function of temperature in zero magnetic field (blue line) and 
at 45 T (green line) using an unchanged measurement configuration on a 
SmBe sample of dimensions 1.1 by 0.3 by 0.1 mm. (Top inset) Measured 
resistance from 80 mK up to high temperatures, from which the high IRR 
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can be ascertained [a fit to activated electrical conductivity is provided in 
(23)]. (Bottom inset) Magnetization of SmBg at 2.1 K remains linear up to 60 T. 
(C) Dominant low-frequency quantum oscillations can be discerned after 
background subtraction of a sixth-order polynomial. (D) Magnetic torque 
at the highest measured fields after the subtraction of the low-frequency 
background torque. Quantum oscillations are visible in an intermediate- 
frequency range (between 2000 and 4000 T) as well as a high-frequency 
range up to 15,000 T. 
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(corresponding to approximately half the volume 
of the cubic Brillouin zone) in SmB,g, is striking. 
This observation is in contrast to previous re- 
ports of very-low-frequency quantum oscillations 
corresponding to a few percent of the Brillouin 
zone in SmBg, attributed to a two-dimensional 
surface contribution (22). Our observation of very 
high quantum oscillation frequencies requiring 
mean free paths on the order of a few micrometers 
would be challenging to explain from a surface 
layer of a few atomic lengths’ thickness, which 
would typically yield such rapid frequencies only 
at a special angle of inclination at which the cy- 
clotron orbit lies completely within the surface 
layer. Key to identifying the Fermi surface from 
which the observed quantum oscillation frequencies 
originate is a comparison with previous quantum 
oscillation measurements on metallic hexaborides, 
such as nonmagnetic LaBg, antiferromagnetic 
CeBg,, and antiferromagnetic PrBg (16, 17). These 
materials exhibit a metallic ground state involving 
predominantly conduction electrons, with a low 
residual resistivity of the order of one microhm cm 
(=10° times lower than in Kondo insulating SmB,) 
and are characterized by a multiply connected 
Fermi surface of prolate spheroids (Fig. 3, D and 
E). Strikingly, the angular dependence of the var- 
ious quantum oscillation frequencies in SmBg 
reveals characteristic signatures of the three- 
dimensional Fermi surface identified in the me- 
tallic rare earth hexaborides (Fig. 3, A to C). In 
particular, the high observed o. frequencies (Fig. 3A) 
reveal the characteristic symmetry of large pro- 
late spheroids centered at X-points of the Brillouin 
zone (Fig. 3, D and E), whereas the lower observed 
frequencies (Fig. 3A) reveal the characteristic 
symmetry of small ellipsoids located at the neck 
positions. Both of these types of ellipsoids are 
universal Fermi surface features identified from 
experiment and band structure calculations in 
the metallic rare earth hexaborides (J6, 17). Sim- 
ilar features are also revealed in density functional 
calculations of SmBg when the Fermi energy is 
shifted from its calculated position in the insu- 
lating gap either up into the conduction or down 
into the valence bands (Fig. 3, D and E) (23). 
The observed angular dependence of quantum 
oscillations in SmBg remains the same irrespec- 
tive of whether the sample is prepared as a thin 
plate with a large plane face perpendicular to the 
[110] direction, or to the [100] direction (fig. S1), 
and exhibits the same characteristic signatures 
with respect to the orientation of the magnetic 
field to the crystallographic symmetry axes of 
the bulk crystal (Fig. 3A). The bulk quantum 
oscillations we measure in SmBg corresponding 
to the three-dimensional Fermi surface mapped 
out in the metallic rare earth hexaborides may 
not be directly related to the potential topolog- 
ical character of SmBg, which would have as its 
signature a conducting surface (24). In addition 
to the magnetic torque signal from the atom- 
ically thin surface region being several orders 
of magnitude smaller than the signal from the 
bulk, the observation of surface quantum oscil- 
lations would be rendered more challenging by 
the reported Sm depletion and resulting recon- 


288 17 JULY 2015 + VOL 349 ISSUE 6245 


- al “42T<B<40T| B ol 
T = 0.35 K | 
0=87° | 
24- 
2 
a | 
i 0 t t + + + + + 
s ty) 400 800 1200 1600 2000 20- 
2 Bt 18T <B<40T we OF 
5 4 5| oO 
= E10) 
£ 5 
oO sg L 
= c 
Ss § qo 
c 0 f f | 
S 2000 4000 6000 8000 l 
3 — ores aL 
5 al 24T <B< 407; 
(o) 
2 L 
4 
F = 332T | 
4 4 { 0 L l 1 l it l 1 l 
10000 12000 14000 16000 0.00 0.02 0.04 0.06 0.08 
Frequency (T) 1/B (T") 


Fig. 2. Landau quantization in SmBg. (A) Fourier transforms of magnetic torque as a function of 
inverse magnetic field, from which a polynomial background has been subtracted, revealing multiple 
quantum oscillatory frequencies ranging from 50 T to 15,000 T. Field ranges for analysis have been 
chosen that best capture the observed oscillations, with the highest frequencies only appearing in 
the higher field ranges. (B) The maxima and minima in the derivative of magnetic torque with 
respect to the magnetic field, corresponding to the dominant low-frequency oscillation, are plotted 
as a function of inverse magnetic field; the linear dependence signals Landau quantization. 


struction of Sm ions at the surface layer of 
SmBg (25). 

The unconventional character of the state we 
measure in SmB, becomes apparent upon inves- 
tigating the temperature dependence of the quan- 
tum oscillation amplitude in SmBg. We found 
that between T = 25 K and 2 K, the quantum os- 
cillation amplitude exhibits a Lifshitz-Kosevich- 
like temperature dependence (Fig. 4) characteristic 
of a low effective mass similar to that of metallic 
LaBg, which has only conduction electrons (J6). 
The comparable size of low-temperature electronic 
heat capacity measured for our SmBg single crys- 
tals to that of metallic LaB, (23) also seems to 
suggest a large Fermi surface with low effective 
mass in SmBg. However, instead of saturating at 
lower temperatures as would be expected for the 
Lifshitz-Kosevich distribution characteristic of 
quasiparticles with Fermi-Dirac statistics (18), the 
quantum oscillation amplitude increases dramat- 
ically as low temperatures down to 30 mK are 
approached (Fig. 4). Such non-Lifshitz-Kosevich 
temperature dependence is remarkable, given the 
robust adherence to Lifshitz-Kosevich tempera- 
ture dependence in most examples of strongly 
correlated electron systems, from the underdoped 
cuprate superconductors (26) to heavy fermion 
systems (27, 28) to systems displaying signatures 


of quantum criticality (29), a notable exception 
being fractional quantum Hall systems (30, 3D. 
The possibility of a subtle departure from Lifshitz- 
Kosevich temperature dependence has been re- 
ported in a few materials (32, 33). 

The ground state of SmBg is fairly insensitive 
to applied magnetic fields, with activated electri- 
cal conductivity behavior across a gap remaining 
largely unchanged up to at least 45 T (Fig. 1B). 
Such a weak coupling to the magnetic field is in 
contrast to unconventional states in other mate- 
rials that are tuned by an applied magnetic field 
(6-11, 13). Furthermore, this rules out the pos- 
sibility of quantum oscillations in SmBg arising 
from a high magnetic field state in which the 
energy gap is closed. The possibility that quan- 
tum oscillations arise from static, spatially dis- 
connected metallic patches of at least 1-m length 
scale that do not contribute to the electrical trans- 
port also appears unlikely. Similar quantum os- 
cillations are observed in all (more than 10) 
measured high-quality samples in multiple high- 
magnetic-field experiments, with the best sam- 
ples yielding magnetic quantum oscillations of 
amplitude corresponding to a substantial frac- 
tion of the expected size from bulk SmBg. The 
presence of rare earths other than Sm has been 
ruled out to within 0.01% by means of chemical 
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Fig. 3. Angular dependence of the quantum oscillation frequencies in 
SmBg. (A) Data from two of the SmBg samples in which oscillations were 
observed are shown, indicated by solid and open circles. One of the samples 
(solid circles) was prepared as a thin plate with the dominant face per- 
pendicular to the [100] axis [sample 1 (23)], and the second sample (open 
circles) was prepared as a thin plate with the dominant face perpendicular to 
the [110] axis [sample 2 (23)]. (B and C) The angular dependence strongly 
resembles that of the three-dimensional Fermi surface in antiferromagnetic 
PrBe shown in (B), and nonmagnetic LaBe shown in (C) (17). (D and E) The a 
orbit in red in all the rare earth hexaborides is fit to large multiply connected 


[110] 


[001] (111] [110] 


prolate spheroids centered at the X points of the Brillouin zone, shown in (D); a 
cross section in the XM plane is shown in (E). The p and p’ orbits in each of the 
rare earth hexaborides are fit to small ellipsoids located at the neck positions 
[not shown in (D) and (E)]. More details of the fits are provided in (23). The 
remaining intermediate orbits are shown with lines as a guide to the eye. All 
orbit identifications have been made after measured frequencies and band 
structure calculations in PrBg and LaBg (17). (D) and (E) show Fermi surfaces 
calculated for SmBg using density functional theory (23), with a downward shift 
of the Fermi energy from its calculated position within the gap to expose the 
unhybridized bands and yield pocket sizes similar to experiment. 
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tion electrons form a solely conduction electron 
Fermi surface, similar to that we observe (2, 35-39). 
A fluctuation time scale in the range between 
10-8 and 10" seconds is suggested by previous 
x-ray absorption spectroscopy and Mossbauer 
measurements (40). This time scale is longer or 


analysis and scanning electron microscopy (23). 
Off-stoichiometric metallic regions of SmBg ap- 
pear an unlikely explanation for our results, 
given reports that up to 30% Sm depletion does 
not close the energy gap (20), whereas scanning 
electron microscopy of our samples reveals a 
homogeneity of within 1% of Sm concentration 
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over the sample area (23). The possibility of spa- 
tially disconnected strained regions of SmBg, 
which is known to become metallic under ap- 
plied pressures on the order of 10 GPa, or static 
spatially disconnected islands of hybridized and 
unhybridized Sm felectrons also seems unlike- 
ly. An improvement in the IRR by means of the 


comparable with the inverse cyclotron frequency 
(1/m.), Which is on the order of 10" seconds for 
the measured cyclotron orbits. Intriguingly, sim- 
ilar slow fluctuations have been invoked to explain 
quantum critical signatures in the metallic electron 
system B-YDAIB, (47). SmBg may be viewed as 
being on the border of quantum criticality in the 
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sense that it transforms from a nonmagnetic 
insulating phase to a magnetic metallic phase 
under applied pressures on the order of 10 GPa 
(42-45), which is in contrast to other metallic 
rare earth hexaborides in which the felectrons 
order magnetically in the ambient ground state. 
Our observation of a large three-dimensional 
conduction electron Fermi surface revealed by 
quantum oscillations may be related to reports 
of a residual density of states at the Fermi energy 
in SmB, through measurements of heat capacity 
(23, 46), optical conductivity (47), Raman scat- 
tering (48), and neutron scattering (49). Another 
possibility is that quantum oscillations could 
arise even in a system with a gap in the excitation 
spectrum at the Fermi energy, provided that the 
size of the gap is not much larger than the cy- 
clotron energy (50). Within this scenario, the re- 
sidual density of states observed at the Fermi 
energy with complementary measurements and 
the steep upturn in quantum oscillation ampli- 
tude we observe at low temperatures appear chal- 
lenging to explain. 
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NANOPARTICLE IMAGING 


3D structure of individual 
nanocrystals in solution 
by electron microscopy 


Jungwon Park,””’?* Hans Elmlund,*** Peter Ercius,°* Jong Min Yuk,”*? 
David T. Limmer,’° Qian Chen,”®" Kwanpyo Kim,” Sang Hoon Han,** 
David A. Weitz,” A. Zettl,”®° A. Paul Alivisatos’®°+ 


Knowledge about the synthesis, growth mechanisms, and physical properties of colloidal 
nanoparticles has been limited by technical impediments. We introduce a method for 
determining three-dimensional (3D) structures of individual nanoparticles in solution. We 
combine a graphene liquid cell, high-resolution transmission electron microscopy, a direct 
electron detector, and an algorithm for single-particle 3D reconstruction originally 
developed for analysis of biological molecules. This method yielded two 3D structures of 
individual platinum nanocrystals at near-atomic resolution. Because our method derives 
the 3D structure from images of individual nanoparticles rotating freely in solution, it 
enables the analysis of heterogeneous populations of potentially unordered nanoparticles 
that are synthesized in solution, thereby providing a means to understand the structure 


and stability of defects at the nanoscale. 


olloidal nanoparticles are clusters of hun- 
dreds to thousands of inorganic atoms typ- 
ically surrounded by organic ligands that 
stabilize them in solution. The atomic ar- 
rangement of colloidal nanoparticles deter- 
mines their chemical and physical properties, which 
are distinct from bulk materials and can be ex- 
ploited for many applications in biological imag- 
ing, renewable energy, catalysis, and more. The 3D 
atomic arrangement on the surface and in the core 
of a nanocrystal influences the electronic struc- 


ture, which affects how the nanocrystal functions 
in catalysis or how it interacts with other com- 
ponents at the atomic scale (7). Introduction of 
atomic dopants, surface adatoms, defects, and grain 
boundaries alters the chemical properties of nano- 
crystals (2). Ensembles of synthesized nanocrystals 
in solution are structurally inhomogeneous because 
of the stochastic nature of nanocrystal nucleation 
and growth (3, 4). Therefore, a method for deter- 
mination of the 3D atomic arrangement of indi- 
vidual unique nanoparticles in solution is needed. 
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Electron tomography is routinely used for 3D 
analysis of materials (5-9). This method cannot 
be applied to individual particles in a liquid be- 
cause it relies on acquisition of images of a single 
object at many different tilt angles over a period 
of 2 to 5 hours, assuming the object is static dur- 
ing the entire acquisition. Single-particle cryo- 
electron microscopy (cryo-EM) is a common 
method for the determination of 3D structures 
in biological sciences. The average 3D Coulomb 
potential map (i.e., density) of a protein is recon- 
structed from tens of thousands of transmission 
electron microscopy (TEM) images of randomly 
oriented copies of the same protein embedded in 
vitreous ice (10). The unknown 3D projection an- 
gles of the images are determined by computa- 
tional methods (77). Single-particle cryo-EM has 
succeeded in reconstructing biological molecules 
with nearly 3 A resolution (JO, 12). A similar ap- 
proach was recently applied to reconstruct the 
atomic structure of homogeneous ultrasmall gold 
clusters (73). However, the single-particle method 
is not readily applicable to 3D reconstruction of 
colloidal nanoparticles because of their intrinsic 
structural inhomogeneity at the atomic level. 

TEM has undergone technical improvements 
in the past decades (5, 14-17). The image reso- 
lution has been improved with the introduction 
of electron lens aberration correctors (15). The 
development of direct electron detectors has led 
to improvements in image quality and rapid 
acquisition of movies that allow for compensa- 
tion for beam-induced specimen motion, thereby 
providing a substantial enhancement to single- 
particle cryo-EM (18). 

We present a hybrid method for reconstruct- 
ing the 3D structures of individual nanoparticles 
in solution. The method represents a combina- 
tion of three technological advancements from 
TEM imaging in biological and materials sci- 
ence: (i) the development of graphene as a cover- 
ing to hold a liquid in vacuum (the so-called 
graphene liquid cell, or GLC) that allows atomic- 
resolution imaging of nanoparticles that move 
and rotate freely in solution by aberration-corrected 
TEM (3, 19); Gi) the advent of direct electron 
detectors, producing movies with millisecond 
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frame-to-frame time resolution of the rotating 
nanocrystals (78); and (iii) a theory for ab initio 
single-particle 3D reconstruction, used to solve 
the inverse problem of recovering the unknown 
3D orientations of the individual noisy nano- 
crystal projections (ZZ). The resulting hybrid tech- 
nique, 3D structure identification of nanoparticles 
by GLC EM (abbreviated as SINGLE), was used 
to separately reconstruct the 3D structures of two 
individual Pt nanocrystals in solution. 

Pt nanoparticles were chosen because of their 
high electron scattering strength, because their 
detailed atomic structure is important for catal- 
ysis, and because earlier graphene liquid cell 
studies have shown that they grow by nano- 
particle aggregation, resulting in complex struc- 
tures that are not possible to determine by any 
previously developed method. Pt nanocrystals 
with sub-2 nm diameter were prepared in solu- 
tion. Two graphene sheets were grown by the 
chemical vapor deposition method and used to 
entrap solvated nanocrystals (3). The graphene 
provides an ultrathin covering of material to 
maintain liquid conditions in the TEM vacuum 
and presents an inert surface onto which the 
nanoparticles do not adsorb. The translational 
and rotational motions of the particles in liq- 
uid pockets with sub-50 nm diameter were 
imaged in situ using TEAM I, a TEM instrument 


Fig. 1. Schematic illus- 
tration of in situ TEM 
imaging of Pt nanocrystals 
freely rotating ina 
graphene liquid cell 
(GLC) and 3D EM 
density maps calculated 
from individual Pt nano- 
particles in solution. 

(A) A movie of the single 
rotating Pt nanocrystal pro- 
vides 2D projected TEM still 
snapshots in many orienta- 
tions for ab initio particle 
reconstruction. (B) EM den- 
sity map obtained from the 
3D reconstruction of parti- 
cle 1. The orientation of the 
particle is aligned to expose 
{111} planes of the core 
domain. Three distinct 
crystal domains can be 
identified. (©) 3D EM den- 
sity map of particle 1 with 
alternative viewing angles. 
(D) EM density map 
obtained from the 3D 
reconstruction of particle 2. 
(E) 3D EM density map of 
particle 2 with alternative 
viewing angles. Each panel in 
(C) and (E) presents the two 
particles with the same angle 
and direction with respect to 
the orientations in (B) and 
(D). Scale bars, 0.5 nm. 


with geometrical and chromatic aberration cor- 
rection, operated at 300 kV using a direct elec- 
tron detector (Fig. 1A). The corrector was set to 
apply a slightly negative spherical aberration coef- 
ficient of about -10 um. Under these imaging con- 
ditions, we produced white atom contrast when 
using a small underfocus value of 30 to 50 A. 
Movies of the moving particles had a temporal 
resolution of 50 frames/s, a field of view of 1024 x 
1024 pixels, and a Nyquist sampling limit of 0.56 A. 
Each movie frame represents a 2D projection of 
many particles in random orientations. To re- 
construct the individual 3D structures, we chose 
a small region around each single particle of in- 
terest in each individual frame to create a set of 
projections per particle. The 3D orientations of 
the resulting series of noisy 2D images of a 
single rotating particle were recovered compu- 
tationally using an ab initio 3D reconstruction 
algorithm adapted from one originally developed 
to recover orientations from cryo-EM images of 
many identical individual particles (11). 
Although the TEM movies contain many par- 
ticles, not all particles could be used for recon- 
struction because of overlap with other particles 
and insufficient rotation. Here, we present the 
two most reliable 3D reconstructions from a 
1561-image series (particle 1 in Fig. 1, B and C, 
and movie S1) and a 1171-image series (particle 2 
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in Fig. 1, D and E, and movie S2). Shown in Fig. 1, 
B to E, are the EM density maps of the two Pt 
nanocrystals. The rendered particle volumes are 
5300 A? (diameter 22 A) for particle 1 and 4800 A? 
(diameter 20 A) for particle 2. Along with the 
direct visualization of the spatial distribution of 


A 
Conical 
protrusion 
ZX Disc 
i | core 
X equatorial \\ 
particle 1 


Fig. 2. The underlying structural principle of the small Pt nanoparticles 
studied here. (A) Schematic illustration of the front view of particles 1 and 
2 shown in Fig. 1. Both particles are composed of a dense central disc of 
atomic planes (blue lines) with conical protrusions (orange lines) anchoring 
on each side of the disc. (B) Cross-sectional view of the EM density map of 
particle 1 and 2 along the vertical plane. (©) View orthogonal to (A), showing 
the overlaid lattices. Scale bars, 0.5 nm. (D) Schematic illustration of the top 
view of particles 1 and 2 shown in (C). (E) Cross-sectional view of the EM 
density map of particles 1 and 2 along the equatorial plane. 


Fig. 3. Cross-sectional study of particle 1. (A) 3D A 
density map of particle 1 with color coding to highlight 
the three sections. Cross sections 1 and 2 are in arbi- 
trary positions near crystal domain interfaces. Scale bar, 
0.5 nm. (B) Slab through the 3D reconstruction of par- 
ticle 1 along the vertical plane, with tentative atomic po- 
sitions indicated. ABC repeats of {111} planes are visible. 
(C) Slab along cross section 1. The exposed (100) sur- 
face of the fcc Pt crystal is shown consistently in both 
exposed surfaces. Intersections with {111} planes in the 
top domain are displayed by red dashed lines. (D) Slab 
along the equatorial plane of (A) exposes a (110) plane. 
The intersections with {111} planes are shown by blue 
dashed lines. Pseudo-atomic structure (right) demon- B 
strates the rotation angle (14°) between the (100) and 

(110) surfaces from the top and bottom domains, re- 

spectively. (E) Slab along cross section 2. The disordered 

(100) surface of the fcc Pt crystal is shown consistently 

in both exposed surfaces. Intersections with pseudo 

{111} planes in the bottom domain are displayed by red 


dashed lines. 
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Pt atomic planes, external and internal struc- 
tures of the particle are uncovered. Each recon- 
struction shows three distinct crystal domains in 
both of the Pt particles. In Fig. 1, B and D, we show 
views of the EM maps in an orientation that re- 
veals distinct lattice planes of the core domain. 


particle 2 


Cross- 
section 1 


section 2 


Differently oriented 3D density maps are shown 
in Fig. 1, C and E, and movies S1 and S2. 

Our 3D reconstruction methodology produced 
reconstructions at near-atomic resolution from 
relatively small sets of noisy experimental TEM 
images of nanocrystals in random orientations. 
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We tested the validity of SINGLE using 1000 
multislice TEM simulations of a randomly ori- 
ented Pt nanocrystal with dodecahedral sym- 
metry and corresponding size. Reconstructions 
were obtained from images with a signal-to-noise 
ratio (SNR) roughly corresponding to that of the 
experimental images (see fig. S1). This verified 
that the experimental images can conform to the 
projection-slice theorem with ideal microscope 
conditions (20). 

To confirm the existence of a projected lattice 
in the experimental images, we calculated their 
power spectra, which showed distinct spots along 
different crystalline zone axes (fig. S2). Closer ex- 
amination of the power spectra and correspond- 
ing orientation coverage throughout the image 
series revealed that each particle continuously 
undergoes small local rotations followed by rapid 
orientation changes, often accompanied by later- 
al movement. The noise variance in the individ- 
ual frames of the movie exceeds the signal variance 
by approximately a factor of 5, despite the strong 
scattering from the heavy Pt atoms (movies S3 
and S4 show 200 raw TEM images). Although 
sufficiently thin to observe atomic detail, the liq- 
uid between the graphene sheets introduces a 


granular background, making it difficult to dis- 
tinguish the facets of the nanocrystal in the indi- 
vidual frames. To enhance the image contrast 
and allow accurate 3D orientation determina- 
tion, we averaged the image series in sets of five 
consecutive frames, resulting in a time resolution 
of ~100 ms. After frame averaging, we carefully 
scanned the image series to remove a small frac- 
tion (~20%) of those averages that did not show 
any lattice contrast because the particles had 
moved out of the narrow ideal focal plane of the 
aberration-corrected microscope. 

Initial 3D models were obtained using our re- 
cently developed framework (PRIME) for ab initio 
single-particle 3D reconstruction (JZ). The stan- 
dard technique used in biological single-particle 
cryo-EM (JO) assigns each image the single best 
matching orientation, as determined by correla- 
tion matching to a gallery of reference images, 
obtained by projecting an a priori available 3D 
reference model. Two fundamental limitations 
of the standard technique are the bias introduced 
by the initial 3D model and the lack of mecha- 
nisms for modeling the alignment errors when 
data are noisy and the model is of poor quality. 
These limitations may be quite substantial when 
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Fig. 4. Twist grain boundary free energies as a function of misalignment angle for the (110)-(100) 
crystal planes of a nanocrystal with 1135 atoms. Shaded regions indicate error bars of 1 and 2 
standard deviations. The inset shows a relaxed nanocrystal with an initial misalignment angle of 14° The 
non-flat grain boundary is located in the center of the nanocrystal. Images around the perimeter show the 
two relaxed planes at the grain boundary from the 100 (red) and 110 (white) grains for different 


misalignment angles. 
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reconstructing nanocrystals, because every parti- 
cle is different and the particle population cannot 
be averaged, as in traditional single-particle EM. 
The individual frames also have a low SNR be- 
cause of the relatively low per-frame electron dose 
and because of the granular background intro- 
duced by the liquid. PRIME overcomes these 
drawbacks by using weighted orientation as- 
signment and stochastic optimization for de- 
termination of an optimal orientation weight 
distribution without any a priori information 
about the nanoparticles. 

To initialize the 3D reconstruction process, we 
assigned the random orientations to images, pro- 
ducing a featureless spherical density map. The 
random orientations were refined by stochastic 
optimization of the correlation between the im- 
ages and reprojections of the density map, using 
information from 30 A to 3 A and a discrete 
search space of orientations. The resulting initial 
model had a resolution of 2.5 A according to the 
0.143 Fourier shell correlation (FSC) criterion 
(21). We extended the PRIME algorithm by in- 
troducing a continuous orientation search space 
and used stochastic optimization to determine a 
continuous distribution of weights that related 
the continuous distribution of orientation param- 
eters to the 3D reconstruction. Each round of 
the PRIME iterative alignment procedure in- 
volved determination of orientation weights for 
all particle images, followed by a weighted 3D 
reconstruction by direct Fourier reconstruction 
using a Kaiser-Bessel interpolation kernel. A few 
hundred iterations were executed, and in every 
round the FSC was calculated and used to con- 
struct a 3D Wiener filter that filtered the map 
such that the optimal SNR was obtained at the 
present resolution (22). The resolution of the final 
refined maps was measured to 2.10 A (particle 1) 
and 2.14 A (particle 2), respectively. 

Even though the FSC methodology provides 
an accurate measure of the resolution when the 
reference structure used for matching has been 
appropriately low-pass filtered, it does not en- 
sure against grossly incorrect structures. To vali- 
date our structures, we determined the agreement 
between the individual images used for recon- 
struction and the corresponding reprojections of 
the reconstructed 3D map. All image reprojec- 
tion pairs (fig. S3) were generated for the two 
reconstructions, and the Fourier ring correlation 
(FRC) (23) was calculated between all pairs. The 
average FRC was larger than 0.143 to a resolution 
of 1 A and showed a distinct peak spanning the 
1 to 2 A resolution region (fig. $3). This peak is 
due to the correlation between atomic densities 
in the reprojections and atomic densities in the 
images. We concluded that our reconstructions 
showed excellent agreement with the images. 
The spatial resolution is higher for particle 1, 
which has a larger number of frames, indicating 
that the present resolution can be improved by 
acquisition of longer movies that cover a wider 
range of rotational orientations. 

We had anticipated that the Pt nanocrystals 
would have at least twofold rotational symmetry, 
perhaps even cubic symmetry. Remarkably, the 
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reconstructions appeared asymmetrical, and at- 
tempts to apply C2, C4, C5, and D2 point-group 
symmetry by aligning the images to the principal 
symmetry axis and doing symmetric orientation 
refinement failed to improve the correlation be- 
tween the reprojections and the images. This 
suggests that the Pt nanocrystals do not possess 
icosahedral, cubic, or pentagonal symmetry but 
are intrinsically asymmetrical. Reconstruction 
by the traditional single-particle cryo-EM techni- 
que, using an initial model based on a faceted 
face-centered cubic (fec) nanoparticle, would suf- 
fer severe initial model bias and would not repro- 
duce the true asymmetrical multidomain structure. 

Our reconstructions of particles 1 and 2 re- 
vealed different asymmetrical crystal structures 
with the same underlying principle: a dense cen- 
tral disc of atomic planes (the core) with conical 
extensions anchored on each side, protruding in 
opposing and orthogonal directions with respect 
to the equatorial plane (see schematics in Fig. 2A 
with the lattice in different crystal domains col- 
ored blue and orange). Despite this similarity, 
the reconstructed 3D structures of the two Pt 
particles show interesting differences. First, the 
atomic arrangements on the surfaces observed in 
the EM density maps of particle 1 and 2 are dis- 
tinct (Fig. 1C and Fig. 2E). Second, the two 
particles show different degrees of crystallinity 
in each domain. Whereas straight {111} crystal 
planes in three domains of particle 1 are shown 
throughout the cross-sectional images along the 
mid-vertical plane (Fig. 2B) and vertical planes at 
different depths (fig. S4), particle 2 shows more 
disordered internal structures within the do- 
mains. Also in particle 1, the conical protru- 
sions have {111} atomic planes tilted with respect 
to the {111} atomic planes in the core. Particle 2 
exhibits protrusions with a larger degree of dis- 
order relative to the well-aligned {111} lattices 
in the particle 1 protrusions. Views of the recon- 
structions perpendicular to the equatorial plane 
(Fig. 2C) and a schematic illustration (Fig. 2D) 
show the overlaid lattices of the core (blue lines) 
and protrusions (yellow lines) with different tilt- 
ing angles for particle 1 (14°) and particle 2 (7°). 

The cross-sectional views along the equatorial 
plane (Fig. 2E) for particles 1 and 2 show similar 
arrangements of the {111} lattice planes, but cross- 
sectional images along the horizontal planes be- 
low and above the equatorial plane for the two 
particles (particle 1 in Fig. 3, C to E; particle 2 in 
fig. S5) indicate that the two particles are as- 
sembled by multiple domains but in unique geom- 
etries. The multiple domains and twisted grain 
boundaries that are present in the reconstruc- 
tions are similar to previous observations of Pt 
nanocrystal growth trajectories, where small par- 
ticles were observed to join along surfaces with 
low ligand coverage (3). The multidomain arrange- 
ment is also supported by other tomography 
reconstructions of larger Pt nanocrystals contain- 
ing multiply twinned domains with decahedral 
symmetry and central screw dislocations (3, 5). 
Multidomain structures are common in many 
other colloidal metal nanoparticles, which likely 
evolve as a result of multiple coalescence events 
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during growth (3, 24-26). Our observation of het- 
erogeneously structured Pt nanoparticles formed 
in the same solution confirms that individual 
particles from the same synthesis follow different 
nucleation and growth trajectories. 

Figure 3A presents a 3D density map with col- 
ored sections indicating the upper (blue), core 
(gray), and lower (purple) domains of particle 1. 
Figure 3B shows a cross section perpendicular to 
the equatorial plane with the tentative atomic 
positions (different colored dots in each domain) 
indicating {111} lattice planes of fec crystal that 
have repetition in every three crystal planes in all 
three crystal domains. Atoms closer to the sur- 
face seem to deviate farther from a perfect fcc 
structure. This is presumably explained by the 
fact that surface atoms are prone to relax excess 
free energy because of insufficient coordination 
and stabilization by surface ligand binding. In 
addition, the interface between the domains is 
disordered and not flat—a general consequence 
of the reduction in surface energies expected for 
nanoscale crystals relative to their bulk counter- 
parts. The locations of three cross sections along 
horizontal planes (cross section 1, equator, and 
cross section 2) are indicated in Fig. 3A. Cross 
sections 1 and 2 are positioned near the interface 
between the central disk and the upper and low- 
er conical protrusions, respectively. Cross section 
1 exposes two facing surfaces from the upper and 
core domains (blue and gray densities in Fig. 3C, 
respectively). Cross section 1 shows a (100) surface 
with red dashed lines that trace the {111} planes 
exposed on the (100) surface. The cross section at 
the equator (Fig. 3D) exhibits a (110) surface, and 
blue dashed lines indicate the {111} planes ex- 
posed to the (110) surface. The red and blue lines 
are mapped onto the pseudo-atomic illustra- 
tion of (100) and (110) surfaces with the proper 
orientation (a 14° rotation angle) in the right 
image of Fig. 3D. At cross section 2, near the 
interface between the central disk and the lower 
conical protrusion, a surface structure with pseu- 
do (100) patterns traced by red dotted lines is 
exposed from the core and lower domains (gray 
and purple densities in Fig. 3E, respectively), 
which deviate from the ideal fcc structure. Figure 
3 shows that the conical protrusions and the core 
join along (100) and (110) surfaces with distor- 
tions at the interface. Multiple domains merging 
along the low-index crystal planes, such as (100) 
and (110), are presumably formed by coalescence 
events between small particles during the particle 
growth as a route to minimize excess surface en- 
ergy. We previously observed a similar scenario: 
Two small Pt particles join along the (111) surfaces 
during growth (3). Presumably, coalescence along 
the low-index surfaces and the consequent evolu- 
tion of the interface structure are mechanisms by 
which the nanocrystals reduce free energy. 

To examine whether there is a thermodynamic 
rationale for the multidomain structures that we 
observed, we computed the free energies for the 
grain boundary formation and for the ligand- 
exposed surfaces. This was done using the Frenkel- 
Ladd method (27) for an embedded atom model 
of Pt (28), using an appropriate thermodynamics 


integration path (fig. S6). We found that for low- 
angle misalignments, like those observed in par- 
ticle 1 [14° rotation angle between {111} planes 
exposed on (110) and (100) surfaces from the cen- 
tral disc and upper domain in Fig. 3], the grain 
boundary free energy for a nanoparticle with 
1200 atoms is 0.3 J/m?, increasing to 0.55 J/m? 
for misalignment angles of up to 59° (Fig. 4). 
Small rotational relaxation of the crystal grains is 
found to be energetically viable at these inter- 
faces, which may explain the orientation of the 
side protrusions in particle 1 with respect to its 
central section. 

The surface free energy for the ligand-exposed 
interfaces was computed to be 2.8 J/m? as aver- 
aged over the (100) and (111) surfaces, weighted 
appropriately for the cuboctahedral shape. The 
disparity of scales between the grain boundary 
free energies and the much larger exposed sur- 
face free energies confirms that there is a large 
thermodynamic driving force for coalescence, 
even when such events result in grain boundary 
formation. As the free energy gain upon coalescence 
is much larger than thermal energies, initial aggre- 
gations are likely irreversible. The resultant grain 
boundaries are then kinetically arrested over lab- 
oratory time scales (see supplementary materials). 

Our results show that the SINGLE methodol- 
ogy can be used to investigate the structural 
principles underlying the assembly and transient 
morphology of any stable, small nanoparticle in 
solution. We envision that SINGLE can be ap- 
plied directly to in situ 3D structural studies of 
many other kinds of solvated particles. 
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ANIMAL PHYSIOLOGY 


Summer declines in activity and body 
temperature offer polar bears 
limited energy savings 


J. P. Whiteman,’”* H. J. Harlow,” G. M. Durner,® R. Anderson-Sprecher,* 
S. E. Albeke,” E. V. Regehr,® S. C. Amstrup,’ M. Ben-David?” 


Polar bears (Ursus maritimus) summer on the sea ice or, where it melts, on shore. Although 
the physiology of “ice” bears in summer is unknown, “shore” bears purportedly 

minimize energy losses by entering a hibernation-like state when deprived of food. Such a 
strategy could partially compensate for the loss of on-ice foraging opportunities caused 
by climate change. However, here we report gradual, moderate declines in activity and body 
temperature of both shore and ice bears in summer, resembling energy expenditures 
typical of fasting, nonhibernating mammals. Also, we found that to avoid unsustainable 
heat loss while swimming, bears employed unusual heterothermy of the body core. Thus, 
although well adapted to seasonal ice melt, polar bears appear susceptible to deleterious 
declines in body condition during the lengthening period of summer food deprivation. 


he current rate of Arctic sea-ice loss, un- 

precedented in at least the past several 

thousand years, is outpacing predictions 

and accelerating (7). This raises concerns 

about the persistence of polar bears (Ursus 
maritimus) (2), which hunt on the surface of 
the sea ice, most successfully between April and 
July (3), when ringed seals (Pusa hispida) use 
this substrate for rearing pups and molting (4). 
Between August and October, hunting can be 
poor (5) as seals reduce ice surface time (4). Ad- 
ditionally, in about two-thirds of the polar bear 
range (6), seals become largely pelagic as ice 
retreats from the continental shelf (7, 8). Some 
polar bears spend this period on shore, where 
foraging is also usually limited (9). 

To reduce the loss of body condition during 
summer food deprivation, shore bears purport- 
edly enter a state of lowered activity and resting 
metabolic rate similar to winter hibernation but 
without denning (10). This “walking hiberna- 
tion” could partially compensate for the nega- 


tive impacts of extended ice melt (11). However, 
in western Hudson Bay, Canada, shore bears lose 
body mass at a rate indicative of typical, rather 
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than hibernation-like, metabolism (12). The phys- 
iological state of bears that follow the retreating 
sea ice into the central Arctic basin in summer 
is unknown. In addition, recent sea-ice loss may 
be increasing the frequency of long-distance 
swims by polar bears (13), during which they 
risk losing over 10 times more heat than they 
produce (supplementary text) because their fur 
loses 90% of its insulation value when wet (14), 
and their subcutaneous fat does not provide 
blubber-like insulation (75). 

To understand polar bear responses to these 
challenges of summer ice melt, we investigated 
activity on shore (2008 and 2009) and on ice 
(2009) in the Beaufort Sea (Fig. 1) by affixing 
telemetry transmitters and activity loggers (16) 
to 25 females (mean age = 10 years + 1 SE, age 
range = 4 to 20 years) and one male (age 3). We 
recorded temperatures of the body core (an 
index of metabolic rate) (17) and periphery by 
implanting loggers into the abdomens (core) of 
10 bears (nine females, mean age = 11 years + 2 SE, 
age range = 3 to 23 years; one male, age 6) and 
the rumps (periphery) of seven other individ- 
uals (six females, mean age = 9 years + 2 SE, age 
range = 5 to 20 years; one male, age 2). 
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Fig. 1. The western Arctic on (A) 11 May 2009 and (B) 31 August 2009. Locations are shown for ice 


polar bears (blue circles), shore polar bears (red squares), and whale carcasses (triangles). The 300-m 
depth contour is shown as a dashed line. Sea ice is shown in white. 
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Bears on shore and ice exhibited similar ac- 
tivity patterns (Fig. 2A and tables S1 and S2; in 
table S1, shore data from 2008 and 2009 were 
pooled for lack of difference). Their time spent 
active peaked at ~25% between May and July, 
then fell to 12 to 22% between August and 
October (Fig. 2A), remaining greater than the 
values of 1 to 2% previously observed during 
winter hibernation (/8). The maximum activity 
level we measured (~25%) is approximately 
half that observed in all other bear species (19), 
perhaps reflecting polar bears’ specialization 
for hunting large vertebrate prey (3). The high 
energy costs of finding and subduing such prey 
can be reduced by ambush tactics (20), such as 
still-hunting at seal breathing holes (3). 

Although bears on shore and ice were simi- 
larly active between August and October, move- 
ment rates were higher for ice bears (Fig. 2B 
and table S1), potentially because the sea ice 
underfoot was drifting at 0.30 to 0.60 m/s (27). The 
proportion of rapid movement rates (>0.33 m/s) 
recorded when ice bears were resting (i.e., mo- 
tionless for >98% of the previous half hour) (6) 
increased in late summer (Fig. 2C), suggesting 
that as bears reduced their activity, their move- 
ment rates increasingly reflected ice drift. 

We did not find support for the hypothesis 
that lowered activity of shore and ice bears is a 
response to decreased food availability. For ice 
bears, location relative to primary seal habitat 
over the continental shelf (7, 8) was a poor pre- 
dictor of activity (mean model coefficients over- 
lapped with zero; n = 23 bears) (table S3). This 
result suggests that additional factors deter- 
mine seal distribution (e.g., fine-scale variation 
in productivity) (8) or their availability as prey 
(e.g., time spent on the ice surface) (4). Similar- 
ly, access to concentrated food resources for 
shore bears [i.e., locations within 500 m of bow- 
head whale (Balaena mysticetus) carcasses from 
Inuit subsistence harvest (table S4)] was not 
associated with activity levels (n = 7 bears) 
(table S3). 

Bears may reduce activity to avoid heat stress 
in summer, because their large body size and 
low ratio of surface area to volume hinder heat 
dissipation. In a previous study, captive polar 
bears became hyperthermic while walking on 
treadmills at speeds =1.6 m/s when air temper- 
ature was =>-5°C (22). However, air temperature 
in our study (daily means: -14.8° to 15.1°C) was 
unrelated to activity of shore bears (n = 12 bears) 
(table S3). Also, only 31 of our 61,882 measure- 
ments of movement rate (shore and ice com- 
bined; 2 = 25 bears) were =1.6 m/s, indicating 
that free-ranging bears seldom walk that rapid- 
ly. Thus, locomotion-induced heat stress is prob- 
ably rare and insufficient to explain the reduction 
in summer activity. 

Core body temperatures (monthly means of 
smoothed data) (16) did not differ between shore 
and ice bears during summer (Fig. 2D, table S1, 
and fig. S1), suggesting that all bears had similar 
nonhibernating, resting metabolic rates (fig. S2A). 
Mean core temperatures of ice bears gradually 
declined from May (37.3°C) to September (36.6°C) 
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(Fig. 2D). It is unclear whether a similar trend 
occurred in shore bears, because they were im- 
planted with loggers in August. 

These gradual temperature declines corre- 
lated with activity (mean r = 0.31, n = 9 bears) 
(table S5) (16) and may have been associated 
with fasting (5). Fasting can cause progressive 
reduction in body temperature of ~1°C in mam- 
mals (23, 24), corresponding with up to ~20% 
decreases in whole-body metabolic rate (23-25). 
However, the reduction in mass-specific meta- 
bolic rate is smaller and sometimes nonexistent 
after the loss of metabolically active tissue is 
considered (24). Unfortunately, we were unable 
to assess mass changes because we captured 
bears in spring before they reached peak mass 
(table S6). Reduced insulation from thinning 
of fur and subcutaneous fat could also cause 
temperature declines, although warm summer 
conditions could counteract insulation loss. 
Hence, gradual declines in the summer core tem- 
perature of polar bears suggest reductions in 
energy expenditure typical of food-deprived mam- 
mals (24). 


Data from one pregnant female that retained 
her logger through January provide evidence that 
polar bears in maternal dens exhibit hibernation 
core temperatures in winter. In contrast to the 
gradual declines observed in summer, her core tem- 
perature abruptly fell to ~35°C after 28 November 
(Fig. 2D and fig. S2B), as would be expected after 
parturition (3, 26). Such low temperature presum- 
ably reflects a 50 to 80% reduction in metabolic 
rate during winter hibernation, similar to that 
seen in other ursids (fig. S2A) (26, 27). 

Core temperatures, like movement rates, in- 
dicated that polar bears did not experience heat 
stress in summer. When walking on a treadmill 
at =1.6 m/s, captive polar bears frequently ex- 
hibited temperatures >39.0°C, including uncon- 
trolled rises to >40.0°C, leading to the conclusion 
that they store excess heat and are inefficient 
walkers (22). However, only 18 of our 27,843 mea- 
surements (7 = 10 bears) were >39.0°C and none 
were >40.0°C. This suggests that polar bear heat 
storage and locomotion efficiency should be re- 
assessed and that polar bears thermoregulate 
effectively during summer. 
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We also observed brief bouts of cold core 
temperatures (<35.0°C, typically <12 hours) as 
low as 22.3°C. These temperature changes were 
too rapid (up to +5.0°C per hour) to represent 
fluctuations in whole-body metabolic rate. Such 
bouts occurred in five of five shore bears and one 
of four ice bears. Similar peripheral temperature 
changes were recorded by rump loggers in six 
shore bears and one ice bear, although maximum 
hourly swings were greater for the periphery 
(means: +11.8°C and -10.4°C) than for the core 
(means: +5.0°C and -5.0°C) (supplementary text). 

It is unlikely that cold bouts of peripheral and 
core temperatures reflected consumption of ice 
or lying on it, as some occurred during above- 
freezing air temperature when sea ice was ab- 
sent. Instead, cold bouts appeared to be caused 
by two modes of regional heterothermy: (i) cool- 


ing the body periphery during inactivity and (ii) 
cooling the periphery and part of the core while 
swimming (Fig. 3, A to D and fig. $3). Support- 
ing this distinction, peripheral temperatures fell 
below 35.0°C during both inactivity (49 of 800 mea- 
surements, n = 3 bears) and swimming (572 
of 741 measurements, n = 3 bears). In contrast, 
core temperatures fell below 35.0°C only during 
swimming (6 of 11 measurements, 7 = 1 bear) 
(supplementary text) and never during inactivity 
(O of 353 measurements, 7 = 1 bear). Further- 
more, cold peripheral temperatures (<35.0°C) 
were associated with lower activity and warmer 
collar temperatures (i.e., when the bear curled 
up, warming the collar sensor) than were cold 
core temperatures (Fig. 3, E and F). 

Regional heterothermy of the body core is 
unusual (77) and may minimize heat loss while 
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swimming. Immersed polar bears probably re- 
duce skin temperature to several degrees above 
the surrounding water (supplementary text) (5), 
similar to seals (28). Our data suggest that bears 
maintain an internal thermal gradient by tem- 
porarily cooling the outermost tissues of their 
core to form an insulating shell. A similar pro- 
cess occurs in diving king penguins (Aptenodytes 
patagonicus), another polar endotherm without 
blubber (29). Control of this process is probably 
active (e.g., via vasoconstriction), because ab- 
dominal loggers cooled more quickly than is 
possible passively, and temperatures were sub- 
sequently regulated (Fig. 4 and supplementary 
text). This regional heterothermy may represent 
an adaptation to long-distance swims (13), al- 
though its limits remain unknown. One of our 
bears survived a 9-day swim, but when recap- 
tured 7 weeks later, she had lost 22% of her 
body mass and her cub (30). 

Sea-ice loss (7) increasingly limits spring and 
summer hunting opportunities for polar bears 
in parts of their range (2). In the Beaufort Sea 
and elsewhere (2, 31), this has reduced the en- 
ergy stores available for bears during subse- 
quent food deprivation (2, 5). We found that 
both core temperature and activity remained 
above values observed during winter hiberna- 
tion. The gradual declines in core temperature 
during summer suggest a typical mammalian 
response to fasting, which offers limited to no 
energy savings based on mass-specific meta- 
bolic rates (24). Thus, our data indicate that 
bears cannot use a hibernation-like metabolism 
to meaningfully prolong their summer period 
of fasting and reliance on energy stores. In 
conjunction with theoretical models linking nor- 
mal metabolic rate to depletion of stored energy 
and mortality (32), our findings suggest that 
bears are unlikely to avoid deleterious declines 
in body condition, and ultimately survival, that 
are expected with continued ice loss and length- 
ening of the ice melt period (2). 
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Fig. 4. Cooling curves of a polar bear. Hourly ab- 
dominal (core) temperatures [06:00 (UTC —08:00) 
on 2 October to 18:00 on 3 October 2009] represent 
swimming based on sparse location data (solid cir- 
cles). Predicted intraperitoneal temperatures (open 
circles) represent cessation of heat production 
(death) at the asterisk and subsequent immer- 
sion in 4°C water (supplementary text). 
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Keeping cool: Enhanced optical 
reflection and radiative heat 
dissipation in Saharan silver ants 


Norman Nan Shi,’ Cheng-Chia Tsai,’ Fernando Camino,” Gary D. Bernard,” 


Nanfang Yu,’* Riidiger Wehner** 


Saharan silver ants, Cataglyphis bombycina, forage under extreme temperature conditions 
in the African desert. We show that the ants’ conspicuous silvery appearance is created 
by a dense array of triangular hairs with two thermoregulatory effects. They enhance not 
only the reflectivity of the ant’s body surface in the visible and near-infrared range of 

the spectrum, where solar radiation culminates, but also the emissivity of the ant in 

the mid-infrared. The latter effect enables the animals to efficiently dissipate heat back 
to the surroundings via blackbody radiation under full daylight conditions. This biological 
solution for a thermoregulatory problem may lead to the development of biomimetic 
coatings for passive radiative cooling of objects. 


he silver ants of the Sahara desert, 

Cataglyphis bombycina, inhabit one of 

the hottest terrestrial environments on 

Earth, where they occupy the ecological 

niche of a “thermophilic scavenger” (7). In 
wide-ranging foraging journeys, they search for 
corpses of insects and other arthropods that have 
succumbed to the thermally harsh desert condi- 
tions, which they themselves are able to with- 
stand more successfully. On hot summer days, 
they may reach maximal foraging activities when 
temperatures of the desert surface are as high as 
60° to 70°C and their body temperatures measured 
as “operative environmental temperatures” are in 
the range of 48° to 51°C (2, 3). In order to survive 
under these conditions, occasionally the ants 
must unload excess heat by pausing on top of 
stones or dry vegetation, where, because of the 
steep temperature gradient above the sand sur- 
face, they encounter considerably lower tem- 
peratures. Under the midday sun of a summer 
day, the ants may resort to this thermal respite 
(cooling off) up to 70% of their entire foraging 
time (3). In keeping their body temperature 
below their critical thermal maximum of 53.6°C 
(4), they need not only to reduce heat absorp- 
tion from the environment but also to be able to 
efficiently dissipate excess heat, so that they can 
minimize the amount of time spent in thermal 
refuges. 

As we showed, through a series of optical and 
thermodynamic measurements, full-wave simu- 
lations, and heat-transfer modeling, a dense array 
of triangularly shaped hairs, characteristic of 
Cataglyphis bombycina, enables the ants to main- 
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tain lower body temperatures by (i) reflecting a 
large portion of the solar radiation in the visible 
and near-infrared (NIR) range of the spectrum 
and (ii) radiating heat to the surrounding envi- 
ronment by enhancing the emissivity in the mid- 
infrared (MIR), where the blackbody radiation 
spectrum of the ant’s body culminates. The thermo- 
regulatory solutions that the silver ants have 
evolved to cope with thermally stressful condi- 
tions show that these animals are able to con- 
trol electromagnetic waves over an extremely 
broad range of the electromagnetic spectrum 
(from the visible to the MIR) and that different 
physical mechanisms are employed in different 
spectral ranges to realize an important biological 
function. 

Specimens of Cataglyphis bombycina collected 
in Tunisia (34°10'N, 08°18'E) were used for all of 
the optical and thermodynamic measurements. 
In these ants, the dorsal and lateral sides of the 
body have a silvery glare (Fig. 1A) and are cov- 
ered by dense and uniform arrays of hairs (Fig. 1B 
and fig. S4). As scanning electron microscopy 
(SEM) images show, the hairs, which gradually 
taper off at the tip, are locally aligned in the same 
direction (Fig. 1C). Their most remarkable struc- 
tural feature is the triangular cross-section char- 
acterized by two corrugated top facets and a flat 
bottom facet facing the ant’s body (Fig. 1, D and 
E). Cross-sectional views obtained by focused ion 
beam (FIB) milling show that the gap between 
the bottom hair facet and the cuticular surface 
also varies but is generally larger than a few 
hundred nanometers. 

Optical reflectivity measurements of ant speci- 
mens were obtained with two Fourier transform 
spectrometers, one collecting spectra in the visible 
and NIR (wavelengths from 0.45 to 1.7 um) and the 
other in the MIR (wavelengths from 2.5 to 16 tum). 
The visible and NIR measurements showed 
that hemispherical reflection [i.e., the sum of 
specular and diffuse reflection collected through 
an integrating sphere (2)] is substantially enhanced 
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in regions with intact hair coverage as compared to 
regions from which the hairs had been removed 
(Fig. 2A and fig. S1). The hair-covered region re- 
flects 67% of the incoming solar radiation rather 
than only 41%, as is the case after their removal. 
This enhancement is due to scattering within the 
triangular hairs (Mie scattering), where light gets 
trapped and then reradiates out in all directions 
(5-7). Individual hairs of given cross-sectional di- 
mensions generate enhanced reflection due to 
scattering at specific wavelengths where funda- 
mental and higher-order Mie resonance modes 
are supported (Fig. 2C and fig. S5). Due to the 
variation in cross-sectional areas, resonance peaks 
from individual hairs are averaged out, so that 
the hair cover effectively acts as a coating with 
enhanced broadband reflection. 

Because of the ellipsoidal shape of the ant’s 
body, about two-thirds of the dorsolateral surface 
is obliquely hit by solar radiation (fig. S4). This 
prompted us to examine the reflectivity as a 
function of the incidence angle of radiation, which 
was varied from 0° to 80°, with 0° representing 
the direction normal to the surface. As the results 
show, Mie scattering enhances reflectivity over all 


A 


angles when regions with intact hair cover are 
compared to those with hairs removed (Fig. 2B). 
With increasing angle of incidence, reflectivity 
enhancement becomes particularly strong at be- 
yond 30° This is the critical angle at which total 
internal reflection starts to occur at the bottom 
facets of the hairs (Fig. 2D, II). At angles ap- 
proaching 90% reflectivity drops off when total 
internal reflection at one of the top facets starts 
to direct more of the radiation toward the ant’s 
body (Fig. 2D, III). 

Next, we performed finite-difference time- 
domain (FDTD) simulations in order to demon- 
strate the functional significance of the triangular 
cross section of the hairs in enhancing reflectivity 
in the visible and NIR ranges (figs. S5 to S8). 
These simulations compared the reflective prop- 
erties of triangular and circular hairs of the same 
cross sectional area. Even though the enhance- 
ment of reflectivity at normal incidence is com- 
parable in both cases, triangular hairs produce 
an extra enhancement at higher angles of inci- 
dence (Fig. 2B). The reason is that although Mie 
scattering of similar strength occurs in both cir- 
cular and triangular hairs, in the latter the total 


internal reflection at the bottom facets of the 
hairs enhances reflectivity substantially further. 
The high reflectivity disappeared when the spec- 
imens were wetted by an ethanol-water solution 
(fig. S3), which removed the refractive index con- 
trast between air and hairs and thus destroyed 
the conditions required for Mie scattering and 
total internal reflection. 

Reflectivity measurements performed in the 
MIR range revealed a second important point 
in the silver ant thermotolerance story. When 
proceeding from lower to higher wavelengths, at 
about 8 um, the enhanced reflectivity of regions 
with hair cover as compared to those without 
hairs reverses to reduced reflectivity (Fig. 2E). 
As Kirchhoffs law of thermal radiation states, re- 
duced reflectivity corresponds to enhanced emis- 
sivity (8). At a body temperature of 50°C, which 
the silver ants may reach when foraging at peak 
activity times, the blackbody radiation of the 
ant’s surface would lie in the range of 6 to 16 um 
(peaked at ~9 um) and thus allow the silver ants 
to offload heat more efficiently through radiative 
heat transfer. The latter decreases the steady-state 
body temperature and reduces the respite time. 


Fig. 1. The bright glare of the silver ant and its structural basis. (A) Silver ant offloading heat on top of dry vegetation (18). (B) SEM frontal view of the head 
densely covered by hairs. (C) SEM image of the hairs gradually tapering off toward the tip. (D) Cross-sectional view of the hairs milled with FIB. (E) SEM image of 
two hairs with one flipped upside down to exhibit the flat bottom facet. 
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Fig. 2. Reflectivity of the silver ant’s body sur- 
face from the visible to the MIR range of the 
spectrum. (A) Hemispherical reflectivity measured 
in the visible and NIR. (B) Measurement and sim- 
ulation results showing visible and NIR reflectivity as 
a function of incidence angle. (C) Cross-sectional 
view of a two-dimensional distribution of a light fi 
(magnitude of electric field component of light, or 
JE]) around a triangular hair for three exemplary Mie 
resonances. (D) Schematic diagram showing the in- 
teraction between visible and NIR light and a hair at 
small (1), intermediate (II), and large (lll) incidence 
angles. The corrugated upper two facets may 
hance diffuse reflection in the ultraviolet and visi 
ranges. (E) Reflectivity measured in the MIR at normal 
incidence. (F) Simulated MIR reflectivity as a function 
of incidence angle. 


How does the hair cover with its enhanced 
reflectivity in the visible and NIR and its enhanced 
emissivity in the MIR affect the radiative heat 
transfer between the ant’s body and the environ- 
ment? To investigate this question, we performed 
thermodynamic experiments, which mimicked all 
radiative heat transfer effects in the silver ants’ 
natural foraging environment (fig. S2). To accom- 
plish this task, we used a high-power xenon lamp 
to simulate the solar spectral distribution at the 
desert sand surface (9) and a thermoelectrically 
cooled high-emissivity metal plate to simulate the 
clear sky with its low level of blackbody radia- 
tion (10). The ant specimens were suspended on 
thin threads to minimize thermal conduction. 
Thermodynamic experiments were conducted in 
vacuum to study thermal radiation, as well as 
in still air to study the interplay of thermal ra- 
diation and convection. Under both conditions, 
the specimens with their natural hair covers 
were able to maintain significantly lower steady- 
state body temperatures than the same speci- 
mens with the hairs removed (Fig. 3). 

The thermodynamic experiments conducted 
in vacuum comparing specimens before and af- 
ter hair removal further revealed that the hair 
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cover decreases the time constants of temperature 
change (Fig. 3, B and E). The shortened time 
constants indicate an increased rate of radiative 
heat transfer and are a direct confirmation of the 
effect of the hairs in enhancing the MIR emis- 
sivity. By using the time constants and the heat 
transfer model, we computed that the hair cover 
enhances emissivity by about 15% (table S1). This 
enhanced emissivity is due to the fact that at 
large MIR wavelengths (i.e., at wavelengths much 
larger than the dimensions of the cross sections 
of the hairs), the hair structure acts as a gradient 
refractive index layer (fig. S9) (11-13), which pro- 
vides the surface with broadband, broad-angle 
antireflective properties in the MIR (Fig. 2, Eand 
F). Because of the influence of convection, the 
time constants of temperature change decreased 
by a factor of about 3 when the specimens were 
brought from vacuum into air (Fig. 3). This in- 
dicates that radiative heat dissipation amounted 
to about one-half of convection and, therefore, 
still played a significant role in the presence of 
natural convection. 

Applying experimentally extracted parameters 
to the heat transfer model revealed that the en- 
hanced visible and NIR reflectivity and enhanced 


16 


Visible and near-infrared 


Simulations: Pe 
see Triangular hairs +" 
cores Circular hairs |.” 


Experiments: 
Hairs intact 
— Hairs removed 


) 20 40 60 
Incidence angle (deg) 


| ll 
Total internal reflection 


Incidence angle increases 


80 


D Ul 


Mid-infrared 


0.3 


Reflectivity 
° 
N 


Bald region 


° 
oy 


Hairy region 


0.0 


0 : 20 . 40 ’ 60 
Incidence angle (deg) 


80 


MIR emissivity make comparable contributions 
to reducing the steady-state temperature in the 
presence of natural convection (figs. S10 and S11). 
On hot summer days in the Sahara, the foraging 
activities of silver ants often occur under low wind 
or even still air conditions, when the ants have to 
rely on enhanced visible and NIR reflectivity and 
enhanced MIR emissivity equally heavily to re- 
duce their body temperature during the respite 
behavior. 

It is interesting to note that the hairs cover 
only the top and the sides of the ant’s body, where 
they are responsible for the effects described 
above. The absence of hairs on the bottom sur- 
face reduces the radiative energy transfer between 
the hot sand and the cooler ant body, so that the 
animals can reduce the absorption of blackbody 
radiation from the desert floor. 

In conclusion, Saharan silver ants are covered 
with a dense array of triangular hairs on the top 
and sides of their bodies. These silvery hairs 
protect the ants against getting overheated in at 
least three ways. First, as a result of Mie scat- 
tering and total internal reflection, the hairs 
enhance reflectivity in the visible and NIR, where 
solar radiation culminates. Second, in the MIR, 
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Fig. 3. Results of thermodynamic experiments. 
(A) Thermal camera images showing the head of 
an ant specimen at the thermal steady state un- 
der different conditions. Temporal temperature 
profiles measured for the head before and after 
hair removal in vacuum (B) and in still air (©) are 
shown. (D to F) Results obtained for the hind part 
(gaster) of an ant specimen. Insets in (B) and (E) 
are photos of specimens before and after hair 
removal. In the “hairs intact” pictures of head and 
gaster, because of the limited solid angle of illu- 
mination, the silvery glance is not shown all over 
the body surface portrayed in the figures. 
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relationship between productivity 
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The search for predictions of species diversity across environmental gradients has 
challenged ecologists for decades. The humped-back model (HBM) suggests that plant 
diversity peaks at intermediate productivity; at low productivity few species can tolerate 
the environmental stresses, and at high productivity a few highly competitive species 
dominate. Over time the HBM has become increasingly controversial, and recent studies 
claim to have refuted it. Here, by using data from coordinated surveys conducted 
throughout grasslands worldwide and comprising a wide range of site productivities, we 
provide evidence in support of the HBM pattern at both global and regional extents. The 
relationships described here provide a foundation for further research into the local, 
landscape, and historical factors that maintain biodiversity. 


espite a long history of research, the nature 

of basic patterns between environmental 
factors and biological diversity remain 
poorly defined. A notable example is the 
relationship between plant diversity and 
productivity, which has stimulated a long-running 
debate (/-6). A classic hypothesis, the humped- 
back model (HBM) (7), states that plant species 
richness peaks at intermediate productivity, tak- 
ing above-ground biomass as a proxy for annual 
net primary productivity (7-9). This diversity peak 
is driven by two opposing processes. In unproduc- 
tive ecosystems with low plant biomass, species 
richness is limited by abiotic stress, such as in- 
sufficient water and mineral nutrients, which 
few species are able to tolerate. In contrast, in the 
productive conditions that generate high plant 
biomass, competitive exclusion by a small num- 
ber of highly competitive species is hypothesized 
to constrain species richness (7-9). Other mech- 
anisms that may explain the unimodal relation- 
ship between species richness and biomass include 
disturbance (7, 10), evolutionary history and dis- 
persal limitation (7, 72), and the reduction of total 
plant density in productive communities (13). 
Since its initial proposal, a range of studies 
have both supported and rejected the HBM, and 
three separate meta-analyses reached different 
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conclusions (14-17). Although this inconsistency 
may indicate a lack of generality of the HBM, it 
may instead reflect a sensitivity to study meth- 
odology, including the plant community types 
considered, the taxonomic scope, the range of 
site productivities sampled, the spatial grain and 
extent of analyses (17, 18), and the particular 
measure of net primary productivity used (19). 
The questions therefore remain open as to what 
the form of the relationship between diversity 
and productivity is, and whether the HBM serves 
as a useful and general model for grassland eco- 
system theory and management. 

We quantified the form and the strength of the 
richness-productivity relationship by using glob- 
ally coordinated surveys (20), which yielded scale- 
standardized data and were distributed across 
30 sites in 19 countries and six continents (Fig. 1). 
Collectively, our samples spanned a broad range 
of biomass production (from 2 to 5711 gm) and 
grassland community types, including natural and 
managed (pastures and meadows) grasslands 
over a wide range of climatic zones (temperate, 
Mediterranean, and tropical), and altitudes (ow- 
land to alpine) (table S1). Our protocol involved 
sampling 64 1-m? quadrats within 8-m-by-8-m 
grids (18, 21). At each site, between 2 and 14 grids 
were sampled, thus resulting in 128 to 896 quad- 


rats per site. In each 1-m? quadrat, we identified 
and counted all plant species and harvested above- 
ground biomass and plant litter. Litter production 
is a function of annual net primary productivity 
in grasslands and can have profound effects on 
the structure and functioning of communities, 
from altering nutrient cycling to impeding vege- 
tative growth and seedling recruitment (22, 23), 
thereby also playing a major role in driving com- 
munity structure. Indeed, the HBM was origi- 
nally defined in terms of live biomass plus litter 
material (7, 8). Most of the sites in our survey 
were subject to some form of management, usually 
livestock grazing or mowing. In this respect, 
our sites are representative of most of the world’s 
grasslands. Our sampling was conducted at 
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least 3 months after the last grazing, mowing, or 
burning event and at the annual peak of live bio- 
mass, which, when coupled with litter, constitutes 
a reliable measure of annual net aboveground pro- 
duction in herbaceous plant communities (24). 

Our results strongly support the HBM of the 
plant richness-productivity relationship. By using 
a global-extent regression model (N = 9631 1-m 
quadrats) (27), we found that plant richness 
formed a unimodal relationship with productivity 
(Fig. 2A) that is characterized by a highly sig- 
nificant concave-down quadratic regression [neg- 
ative binomial generalized linear model (GLM); 
Table 1]. This relationship was not sensitive to 
the statistical model used; the hump-backed re- 
lationship was also evident when we used a neg- 
ative binomial generalized linear mixed model 
(GLMM) that accommodated the hierarchical 
structure of our sampling design (grids nested 
within sites; Table 1 and fig. S1). 

At a sampling grain of 1 m?, 19 of 28 site level 
analyses (68%) yielded significant concave-down 
relationships (table S2 and Fig. 2A). This con- 
trasts markedly with the results of Adler et al. (1), 
who found only 1 of their 48 within-site analyses 
to be significantly concave-down. We also found 
the form of the productivity-diversity relation- 
ship to be robust to sampling grain: by using 
grains of 1 m? up to 64 m?, each time main- 
taining a global extent, we consistently found a 
significant concave-down relationship, although 
the proportion of variation explained tended to 
decrease with increasing grain (fig. S2). 

The HBM predicts a boundary condition or 
upper limit to diversity that, in any given site, 


may not be realized for a variety of reasons (8). 
Consistent with this view, our global-extent as- 
sociation is characterized by a significant concave- 
down quantile regression (95th percentile) (Table 
1), below which considerable scatter exists (Fig. 
2A). This pattern was also insensitive to the 
statistical method used; a hierarchical Bayesian 
analysis that accommodated the nested sampling 
design and that enabled both the mean and the 
variance of species richness to be modeled more 
accurately against (log-transformed) biomass also 
revealed a significant 95th percentile quantile 
regression (fig. S3). Likewise, we found a signifi- 
cant, concave-down quantile regression (95th per- 
centile) between the maximum (quadrat-scale) 
richness found within a grid and the total biomass 
of the same quadrat (Table 1 and fig. S4). Each of 
these approaches to characterizing boundary con- 
ditions suggests the existence of a “forbidden 
space,” wherein high productivity precludes high 
local diversity. Furthermore, they suggest that 
extremely low-productivity sites rarely accommo- 
date high diversity. 

Why do our data show a hump-backed rela- 
tionship, whereas those of Adler et al. (1) and 
related studies (4, 6), do not? One possibility is 
that data limitations can thwart detection of the 
HBM (/8). For example, the data used by Adler 
et al. differed from ours in the following poten- 
tially important ways: (i) They exhibited a maxi- 
mum live biomass of only 1535 g* (ours was 3374.2), 
(ii) litter was not included within the calculation 
of biomass, and (iii) sample size was limited to 30 
quadrats per site (ours ranged from 128 to 894 
quadrats per site; table S1). We conducted a form 
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of sensitivity analysis in which we reran our sta- 
tistical analyses using random subsets of our 
data that were constrained to exhibit similar 
properties to those of the Adler et al. data set. 
Specifically, after limiting the overall data set to 
less than 1535 g and excluding litter, we ran- 
domly selected 30 quadrats per site 500 times, 
each time conducting the within-site regression 
analyses (NV = 30 for each of the 28 site-level 
GLMs conducted per subsampling iteration). For 
each iteration, we also calculated the average 
range of biomass spanned by the 28 site-level 
relationships. Across the 500 iterations (one 
example set of outcomes is shown in Fig. 2B), 
the average proportion of significant concave- 
down, within-site regressions was 0.31 + 0.003 
(SEM), significantly less than our observed pro- 
portion of 0.68 (fig. S5). Moreover, when signif- 
icant concave-down relationships were detected, 
they tended to span a broader range of biomass 
than the remaining forms (including nonsignifi- 
cant relationships). Specifically, in 458 of the 500 
iterations (92%), the mean biomass range of the 
concave-down regressions was larger than the 
mean of the remaining forms’ biomass ranges 
(binomial test: P < 2.2 x 107°). Last, the 48 
within-site analyses of Adler et al. spanned, on 
average, a live biomass range of 428.7 g° + 38.36 
(range of 89 to 1217 g°). This is (i) less than half 
of the average range encompassed by our 28 site- 
level analyses shown in Fig. 2A (mean = 1067.5 ¢? + 
140.63; range of 286 to 3256 °°) and (ii) almost 50% 
narrower than the smallest average biomass range 
encompassed by our 500 random subset analyses 
(627.4 g?) (fig. S6). Taken together, these findings 
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Fig. 1. Site locations. Locations of the geographic centroids of the 30 study sites, which include 151 sampling grids. Some points overlap and are therefore 
indistinguishable. Additional site details are provided in table S1. Map is displayed using the Robinson projection. 
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strongly suggest that we were able to detect 
more concave-down relationships because of the 
greater sample sizes and biomass ranges in our 
analysis. 

It has been suggested (2) that some previous 
studies, including Adler et al. (1), failed to support 
the HBM because they excluded litter. Although 
we do find a significant concave-down relation- 
ship at the global extent using only live biomass 
(Table 1), a comparison of models using biomass 
versus biomass and litter (both N = 9,631) shows 


A 


total biomass to provide a far better fit [residual 
deviance = 10,105 (live) versus 10,037 (total); Vuong 
2-Statistic for comparing non-nested models: -13.4; 
P < 0.001]. It has also been suggested that previ- 
ous surveys failed to adequately represent high- 
productivity communities. Indeed, our high-biomass 
quadrats (1011 samples were over 1000 g°, ~10% 
of the 9631 samples; maximum of 5711 g~°) con- 
tributed considerably to the right-hand part of 
the fitted humped-back regression. This could be 
a reason why the data set of Adler et al. (2) (in 


which only 0.5% of samples were over 1000 g” 
with a maximum of 1534 g°”) failed to support 
the HBM. Our results therefore show that a test 
of the HBM in herbaceous plant communities 
yields the expected pattern when it is robust and 
comprehensive, spans a wide range of biomass 
production (from 1 to at least 3000 dry g~ year“), 
and provides sufficient replication of quadrats 
along the productivity gradient. 

Competitive exclusion has been cited as the 
primary factor driving low species numbers at 
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Fig. 2. Biomass production as a function of species richness. (A) 
Biomass production-species richness relationships for 28 study sites. 
Solid black line indicates significant quantile regression (95th percentile) 
of overall relationship (quadratic coefficient P < 0.001; N = 9631 quadrats). 
Dashed black line, significant negative binomial GLM (quadratic coefficient 
P < 0.001; N = 9631). Colored lines indicate significant GLM regressions 
(Poisson or quasi-Poisson), with N ranging from 128 to 894 quadrats. 
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(Inset) The frequencies of each form of relationship observed across study 
regions. NS, not significant. (B) Same as (A) but the results are derived 
from the analysis of an example, random subsample of the complete data 
set that satisfies the following criteria: litter biomass excluded, quadrats with 
biomass >1534 g°* excluded, and including 30 (randomly selected) quadrats 
per site (total N = 840). These criteria match the characteristics of the data 
set used by Adler et al. (1). 


Table 1. Regression results. Results of regression analyses of the relationship between productivity and species richness, measured at a global extent and a 
sampling grain of 1-m* quadrat. Total biomass = live biomass + litter biomass. All linear and quadratic term coefficients were highly significant (P < 0.001). 


Productivity measure Type of regression Sample size 


Test of model fit 


Linear term 
coefficient + SEM 


Intercept 
estimate + SEM 


Quadratic term 
coefficient + SEM 


Total biomass negative binomial 9631 likelihood ratio 
-2.52 + 0.23 4.69 + 0.186 -1.04 + 0.037 
sae eae GL (logrlink function). quadrats. st ee ee eee ume 
Total biomass negative binomial 9631 
GLMM (log-link function) quadrats likelihood ratio 
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random effects: 151 grids stat. = 114.0 ae < a 
TES NR ERE Serre OTR Stites tec IN SILC amertee.< Or cICS Sumi ipe Weavcysrete ie ey ac or nme ue oN baa rey cemrr yin Covin Cc mane tne RMN cation yn a iE cee 
eee eee bees -12.9 +7159 45.6 + 5.833 =113 41178 


8 
(log-link function) 


304 = 17 JULY 2015 + VOL 349 ISSUE 6245 
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high plant biomass (7, 8, 25). However, in the 
case of nitrogen addition the negative relation- 
ship between productivity and species richness 
has been shown to diminish over time [(26), but 
see (27, 28)]. It may be that low species richness 
in high-productivity conditions arises in part be- 
cause most such habitats are anthropogenic, and 
there are few species in the local pool adapted to 
these conditions (11, 12). If so, it is possible that 
species will eventually immigrate from distant 
pools, so that the right-hand part of the hump 
will then flatten out. 

We have shown a global-scale concave-down 
unimodal relationship between biomass produc- 
tion and richness in herbaceous grassland com- 
munities. However, the original HBM (7) is vaguely 
articulated by the standards of modern ecological 
theory, and it is clear that more work is needed 
to determine the underlying causal mechanisms 
that drive the unimodal pattern (J, 6, 17, 18). We 
recognize that, in our study and many others, 
productivity accounts for a fairly low proportion 
of the overall variation in richness and that many 
other drivers of species richness exist (28-30). 
Accordingly, we echo the call of Adler et al. (1 for 
additional efforts to understand the multivariate 
drivers of species richness. 
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Reverse glacier motion during 
iceberg calving and the cause of 


glacial earthquakes 


T. Murray,’* M. Nettles,” N. Selmes,’ L. M. Cathles,” J. C. Burton,* T. D. James," 
S. Edwards,* I. Martin,° T. O’Farrell,® R. Aspey,° I. Rutt,’ T. Baugé” 


Nearly half of Greenland’s mass loss occurs through iceberg calving, but the physical mechanisms 
operating during calving are poorly known and in situ observations are sparse. We show that 
calving at Greenland’s Helheim Glacier causes a minutes-long reversal of the glacier’s horizontal 
flow and a downward deflection of its terminus. The reverse motion results from the horizontal 
force caused by iceberg capsize and acceleration away from the glacier front. The downward 
motion results from a hydrodynamic pressure drop behind the capsizing berg, which also causes 
an upward force on the solid Earth. These forces are the source of glacial earthquakes, globally 
detectable seismic events whose proper interpretation will allow remote sensing of calving 
processes occurring at increasing numbers of outlet glaciers in Greenland and Antarctica. 


ne-third to one-half of Greenland’s total 
mass loss occurs through iceberg calving 
at the margins of tidewater-terminating 
glaciers (J, 2). Recent rapid changes in glacier 
dynamics are associated with increased 
calving rates (3-5) and increased rates of glacial 
earthquakes (6). At large glaciers with near-grounded 
termini, calving typically occurs when buoyancy 
forces cause icebergs that are the full thickness of 
the glacier to capsize against the calving front (6-9). 
This type of calving is associated with glacial 
earthquakes (6, 7, 10), long-period seismic emis- 
sions of magnitude ~5 that are observed globally 
(11). These earthquakes have expanded north- 
ward and increased sevenfold in number during 


the past two decades (6, 12, 13), tracking changes 
in glacier dynamics, the retreat of glacier fronts, 
and increased mass loss (6, 14). Buoyancy-driven 
calving represents an increasingly important source 
of dynamic mass loss (6-8) as glacier fronts through- 
out Greenland have retreated to positions near 
their grounding lines (75). However, because of 
the difficulty of instrumenting the immediate near- 
terminus region of these highly active glaciers, few 
direct observations of the calving process are avail- 
able, limiting development of the deterministic 
calving models required for improved understand- 
ing of controls on dynamic ice-mass loss. Detailed 
knowledge of the glacial earthquake source would 
allow quantification of calving processes for a large 
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class of Greenland glaciers, as well as for glaciers in 
several regions of Antarctica (13). 

Agreement on the source mechanism of glacial 
earthquakes is limited. Analysis of long-period 
seismic data shows that a sub-horizontal force 
acts approximately perpendicular to the glacier 
calving front during the earthquakes (6, 13). The 
observed seismic signal is generated over a pe- 
riod of 1 min or more (6, 7/7, 16), much longer than 
the source duration for tectonic earthquakes of 
similar size (17). Some authors favor a model in 
which momentum transfer produces a force acting 
in the upglacier and then downglacier directions as 
a newly calved iceberg overturns, accelerates away 
from the calving front, and subsequently deceler- 
ates (6, 10, 13, 18). Others suggest that the seismic 
signal arises from the iceberg scraping along the 
calving front or fjord bottom (7) or colliding with 
the glacier terminus (J9). Hydrodynamic interac- 
tions with fjord water may be important (20) but 
are little explored. Analytical investigations admit 
more than one possible mechanism for the earth- 
quakes (1/8), and no persuasive explanation has 
been presented for the vertical component of the 
earthquake force. We combined geodetic, seismic, 
and laboratory data to identify the forces acting 
during calving at large glaciers and to document 
the source of the associated seismic signals. 

We recorded geodetic data at the calving margin 
of Helheim Glacier (Fig. 1) (9), a major outlet of 
the Greenland Ice Sheet, during 55 days in July- 
September 2013. A wireless network of on-ice 
Global Positioning System (GPS) sensors (27) cap- 
tured glacier motion with centimeter-level accu- 
racy at a high temporal sampling rate in positions 
very close to the calving front (22). Hourly images 
from two cameras located ~4 km down-fjord from 
and looking at the calving front were used in stereo 
configuration to obtain the three-dimensional 
geometry of the calving front and calved icebergs 
(8, 22). Data from the global seismographic net- 
work were analyzed for the same time period to 
identify glacial earthquakes (13, 23) and obtain 
source parameters (11), including the orientation 
of the force active during the earthquake and the 
amplitude and centroid time, , of a centroid- 
single-force (CSF) history of prescribed shape (22). 

The glacier retreated ~1.5 km in a series of 
calving events during the observing period. We 
identified 10 large calving events from the camera 
images. All coincided with glacial earthquakes; in 
two cases, two earthquakes occurred between sub- 
sequent images. During the earthquakes, the region 
near the calving front showed a dramatic reversal 
of flow, moving upglacier for several minutes 
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while simultaneously moving downward (Fig. 2 
and fig. S1). The horizontal and vertical motion 
then rebounded rapidly. 

Observations from a glacial earthquake occurr- 
ing on day of year (DOY) 206 at 03:13:47 UTC are 
shown in Fig. 2, A and C. Analysis of camera im- 
ages indicates ice loss of 0.461 + 0.009 km*? (Fig. 
1) at a location of ice thickness 0.79 km, yielding 
an iceberg volume of 0.36 km? with an aspect ratio 
of 0.23. The earthquake had a CSF amplitude of 
0.24 x 10“ kg-m, with the force oriented 64°W 
(Fig. 1) and 9° above horizontal. GPS sensor 
1 (Fig. 1) showed a pre-earthquake flow speed 
of 29 m/day. Immediately before the earthquake 
centroid time, the sensor reversed its direction and 
moved upglacier at ~40 m/day (displacement = 
9 cm) and downward (displacement = 10 cm). The 
reversed motion was sustained for ~200 s and was 
followed by a downglacier rebound at ~190 m/day 
(displacement = 20 cm) and upward movement 
(displacement = 16 cm) for ~90 s. Similar tem- 
porally coincident signals were detected by near- 
by sensors 6 and 15 (Fig. 1 and fig. S1). 

Glacier deflection for a calving event on DOY 
212 (Fig. 1) is shown in Fig. 2, B and D. We ob- 
served similar responses for all glacial earthquake- 
iceberg calving events during which GPS sensors 
recording data of adequate quality were located 
within 500 m of the calved block (a total of nine 
glacial earthquakes and eight image pairs). These 
events occurred on DOY 205, 206 (three events), 
207, 211, 212, and 226 and were detected by multi- 
ple GPS sensors (further examples in fig. S1). 

The earthquake centroid times occurred at or 
near the end of the glacier’s rapid rebound phase, 
such that the upglacier earthquake force aligned 
in time with the reverse motion of the glacier. 
The horizontal glacier deflection is consistent 
with a model in which the reaction force on the 
glacier caused by seaward acceleration of the 
newly calved iceberg compresses the glacier front 
elastically. The front then rebounds as the force 
decreases and reverses polarity during iceberg 
deceleration. The glacier front thus acts as a 
spring, compressing and re-extending in phase 
with the applied force, which is the horizontal 
component of the seismic source. 

The downward deflection of the glacier front 
occurred in a region where vertical motion of the 
GPS sensors at tidal frequencies showed that the 
glacier is ungrounded and seawater is present 
beneath it. Iceberg rotation is likely to cause a 
low-pressure zone in the opening cavity between 
the iceberg and the glacier front. This pressure 
decrease would lower the load on the bedrock, 
resulting in an upward force acting on the solid 
Earth, as observed in our seismic analysis. A pres- 
sure decrease near the calving front would apply a 
net downward force on the glacier terminus, low- 
ering the glacier surface in a manner similar to 
that occurring twice each day when the ocean 
tides draw down the water level. At sensors ex- 
periencing earthquake deflections, we observed 
tidal variations in the glacier’s vertical position of 
~0.1 m per 1 m of tidal amplitude. The calving- 
related deflection of the glacier surface was ~0.1 
to 0.16 m, suggesting a change in water pressure 


equivalent to a water-height change of ~1 to 1.6 m, 
or roughly 1 to 2 x 10* Pa. 

No observations of pressure or water-level var- 
jiations are available from the region in the fjord 
immediately in front of the glacier, where thick 
ice mélange (Fig. 1) prohibits instrumentation. 
However, results from analog laboratory experi- 
ments allowed us to evaluate our inferences (22). 
A model glacier “terminus” was secured at one 
end of a water-filled tank, and plastic “icebergs” 
made from low-density polyethylene were placed 
flush against the terminus and allowed to capsize 
spontaneously under the influence of gravita- 
tional and buoyancy forces (24) (Fig. 3). Sensors 
embedded in the model glacier terminus moni- 
tored pressure in the water column and the force 
exerted on the terminus during iceberg capsize. 

The measured force on the terminus as the 
icebergs began to capsize was oriented in the 
upglacier direction and slowly increased as the ice- 
bergs rotated. As the icebergs neared horizon- 
tal, the force decreased rapidly. Pressure at the 
terminus decreased as the icebergs rotated, in- 
creasing again as the icebergs neared horizontal. 
Once the icebergs lost contact with the terminus, 
the measured force and pressure began to oscil- 
late as a result of induced wave action in the tank. 

We scaled up the measured forces and pres- 
sures to match the dimensions of icebergs calved 
at Helheim Glacier (Fig. 3). The laboratory data 
scale by powers of the ratio of the iceberg height 
in the field to the iceberg height in the laboratory 
(20, 24). The scaled peak force agreed well with 
typical values inferred from earthquake analysis 
(~10" N). The scaled peak pressure drop (~5 x 
10* Pa) applied over an area corresponding to the 
iceberg’s map-view dimensions yielded an upward- 
directed force consistent with the seismically in- 
ferred vertical force component, such that the 
total force acting on the solid Earth was oriented 
~10° above horizontal. Computation and inver- 
sion of synthetic seismograms from the scaled 
force and pressure data confirmed the consistency 
of the laboratory model with real-world data. 

We used the scaled force and pressure to 
predict the deformation of the terminus region 
(22). The total force (Fit) per unit area (Ap) act- 
ing on the calving region produces a horizontal, 
linear deflection orthogonal to the calving front, 
such that F,o,/Ap = EAL/L, where E is the Young’s 
modulus of glacial ice. The value of L is chosen to 
provide the best match to the glacier position 
data. This length-scale probably represents the 
distance from the terminus to the grounding 
zone. We modeled the ungrounded section of the 
glacier as an elastic beam of length L loaded by 
the vertical force created by the pressure drop. 
The inferred distances L are a few kilometers, 
consistent with values estimated from GPS data. 

Glacier displacements predicted from the scaled 
laboratory data for iceberg dimensions corre- 
sponding to a calving event on DOY 206 (Fig. 1 and 
Fig. 2A) are shown in Fig. 3. Agreement with the 
observed glacier displacement was very good, 
particularly during the time over which the 
force acted in the upglacier direction (until the 
earthquake centroid time). After this time, the 
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laboratory-derived prediction was dominated by 
oscillations of the water column in the tank, which 
did not contain the thick layer of ice mélange 
present in Helheim Fjord that would be expected 
to dampen such high-frequency oscillations. 
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We conclude that as large icebergs rotate and 
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downglacier flow and temporarily reversing the 
motion of the glacier. Hydrodynamic interaction 
of the iceberg with the fjord water rapidly reduces 
pressure behind the rotating iceberg, resulting in 
an upward force on the solid Earth that is the 


Fig. 1. Helheim Glacier, position of sensors, and 
seismic force directions. The location of GPS sen- 
sors and icebergs calved at Helheim Glacier (HH) for 
glacial earthquake events at 03:13 UTC on DOY 206 
2013 and 19:21 UTC on DOY 212 2013 are sup- 
erimposed on a Landsat 7 image from DOY 167 2013. 
“Affected” sensors exhibit earthquake-related deflec- 
tions. Scan-line-corrector failure stripes have been 
removed for clarity. Glacier flow is from left to right; 
bright white mélange (a mix of iceberg fragments 
and sea ice) can be seen in front of the calving 
margin. Calving-front positions were obtained from 
photogrammetric digital elevation models derived 
from cameras. Positions are meters in Universal 
Transverse Mercator zone 24N. 


Fig. 2. Response of GPS sensors on glacier at 
the time of glacial earthquakes. (A) Sensor 1 at 
03:13 UTC on DOY 206 2013. (B) Sensor 9 at 19:21 
UTC on DOY 212 2013. Blue dots show detrended 
along-flow displacement; red dots show height. 
Shading shows 1o position errors. Horizontal dis- 
placement has trends from 30 to 10 min before t, 
removed (A = 28.9 m/day, B = 246 m/day). 
Height has mean removed. (C and D) Plan view of 
GPS traces shown in (A) and (B) during the 30 min 
surrounding t. (marked as 0). 
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Fig. 3. Scaled laboratory data from glacier “terminus” during “iceberg” capsize event, compared 
with field observations. (A) Horizontal displacement scaled from force (black line) compared with 
downflow GPS data (blue). (B) Vertical displacement scaled from pressure (black line) compared with 
vertical GPS data (red). Errors in laboratory data are standard deviation calculated from repeated 


capsize events. GPS data shown are as in Fig. 2A. Photographs show stages of capsize at times marked 
by dashed lines and (solid gray line) t.. The aspect ratio of the model iceberg is 0.22. 


Fig. 4. Cartoon of glacier 
terminus during calving event. 
Glacier deflection caused by a 
capsizing iceberg is shown 
relative to the initial glacier 
position (dotted line). Acceleration ice flow 
of the iceberg to the right exerts a 
force in the upglacier direction 
(left), leading to reverse motion of 
the GPS sensors (green star). 
Reduced pressure behind the 
iceberg (L) draws water from 
beneath the glacier and from the proglacial fjord, pulling the floating portion of the glacier downward and 
exerting an upward force on the solid Earth. 
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vertical force observed in the earthquake. The 
lowered water pressure draws down the ungrounded 
glacier margin, pulling the glacier surface downward 
during the earthquake. 

Our results document the forces active during 
an increasingly important class of calving events 
and definitively identify the processes that cause 
glacial earthquakes. This understanding of gla- 
cier calving and glacial earthquakes opens the 
potential for remote quantitative characteriza- 
tion of iceberg calving and calving rates, as well 
as improved models for ice-ocean interaction. 
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Morphinan biosynthesis in opium 
poppy requires a P450-oxidoreductase 


fusion protein 


Thilo Winzer,' Marcelo Kern,’ Andrew J. King,’ Tony R. Larson,’ 
Roxana I. Teodor,’ Samantha L. Donninger,’ Yi Li,’ Adam A. Dowle,” 
Jared Cartwright,” Rachel Bates,” David Ashford,” Jerry Thomas,” 
Carol Walker,® Tim A. Bowser,” Ian A. Graham™* 


Morphinan alkaloids from the opium poppy are used for pain relief. The direction of 
metabolites to morphinan biosynthesis requires isomerization of (S)- to (R)-reticuline. 
Characterization of high-reticuline poppy mutants revealed a genetic locus, designated 
STORR [(S)- to (R)-reticuline] that encodes both cytochrome P450 and oxidoreductase 
modules, the latter belonging to the aldo-keto reductase family. Metabolite analysis of 
mutant alleles and heterologous expression demonstrate that the P450 module is 
responsible for the conversion of (S)-reticuline to 1,2-dehydroreticuline, whereas the 
oxidoreductase module converts 1,2-dehydroreticuline to (R)-reticuline rather than 
functioning as a P450 redox partner. Proteomic analysis confirmed that these two 
modules are contained on a single polypeptide in vivo. This modular assembly implies 
a selection pressure favoring substrate channeling. The fusion protein STORR may 
enable microbial-based morphinan production. 


he naturally occurring opiates of the 

morphinan subclass of benzylisoquinoline 

alkaloids (BIAs) include morphine, codeine, 

and thebaine. Morphine and codeine can 

be directly used as analgesic painkillers, 
and thebaine is widely used as a feedstock for 
the synthesis of a number of semisynthetic opi- 
ates, including hydrocodone, hydromorphone, 
oxycodone, and oxymorphone, as well as the 
opioid antagonist naloxone. The discovery and 
isolation of morphine from the opium poppy 
(Papaver somniferum L.) by Friedrich Sertitrner 
in 1806 (J) are a milestone in the history of 
pharmacy. More than 200 years later, opiate 
alkaloid-based pharmaceutical formulations 
remain the most potent treatment for severe 
pain, with sales totaling $US11.6 billion in 
2013 (2). 

BIAs are found in a number of species in the 
Papaveraceae family, but morphine production 
has only been reported in the opium poppy and 
the closely related P. setigerum (3). The morphinan 
backbone contains five asymmetric carbon cen- 
ters. Total chemical synthesis, although possible 
(4), is not an economically viable means of pro- 
duction. Consequently, morphinan alkaloids are 
still exclusively sourced from the opium poppy 
plant. Much effort has gone into the elucidation 
of the morphinan branch of BIA metabolism 
over the past 25 years, resulting in the identi- 
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fication of genes for all but the gateway step in- 
volving the epimerization of (S)- to (R)-reticuline 
(3, 5-14). Thus, although it has been possible to 
produce both (S)- and racemic mixtures of (R,S)- 
reticuline and morphinans in metabolically en- 
gineered microbial systems (75-18), the clean 
enzymatic conversion of (S)- to (R)-reticuline 
remains the goal. 

(S)-reticuline is the central intermediate of 
BIA metabolism (fig. S1), and conversion to 
its R epimer is believed to be a two-step pro- 
cess (19, 20). The S epimer is first oxidized to 
the quaternary positively charged amine 1,2- 
dehydroreticuline, followed by reduction to 
(R)-reticuline (Fig. 1A). Activities for each step 
have been reported, but the identity of the cor- 
responding proteins has not been established 
(19, 20). We have combined a candidate gene 
approach with genetic analyses of F, popula- 
tions of P. somniferum segregating for muta- 
tions that are deficient in (S)- to (R)-reticuline 
conversion and discovered that a fusion pro- 
tein is responsible for sequentially catalyzing 
both steps of the epimerization. 

RNA interference (RNAi) knockdown of co- 
deinone reductase in the opium poppy was re- 
ported to cause accumulation of (S)-reticuline, 
which is eight steps upstream of the codeinone 
reductase substrate (27), a result the authors at- 
tributed to metabolite channeling. We considered 
an alternative hypothesis to be the off-target co- 
silencing of a closely related oxidoreductase in- 
volved in the conversion of (S)- to (R)-reticuline. 
Using the sequence of the RNAi silencing con- 
struct to query an in-house expressed sequence 
tag (EST) library from stem and capsule tissue of 
opium poppies (22), we identified a contiguous 
assembly comprising a cytochrome P450 mono- 


oxygenase that is 3’-linked to an oxidoreductase. 
Sequencing cDNA clones from opium poppy stems 
confirmed the in-frame fusion transcript (Fig. 1A 
and fig. S2). The P450 module was designated 
CYP82Y2. 

To investigate whether the corresponding 
gene is a candidate for one or both steps in the 
epimerization of (S)- to (R)-reticuline, we se- 
quenced corresponding cDNA clones from three 
independent mutants identified from an ethyl 
methanesulfonate-mutagenized population of a 
high-morphine cultivar, HM2 (23). All three mu- 
tants have lost the ability to produce morphinan 
alkaloids and instead accumulate high levels of 
(S)-reticuline as well as the (S)-reticuline-derived 
alkaloids laudanine and laudanosine (Fig. 1, B 
and D). We found that all three mutant lines 
carry mutations in the corresponding gene lo- 
cus (Fig. 1A), which we name STORR [(S)- to 
(R)-reticuline]. The storr-I allele carries a pre- 
mature stop codon corresponding to amino acid 
position W668 in the oxidoreductase module 
of the predicted fusion protein. storr-1 plants 
also contain low but significant levels of 1,2- 
dehydroreticuline (Fig. 1C), which suggests that 
the oxidoreductase module catalyzes the sec- 
ond step of the epimerization, the reduction of 
1,2-dehydroreticuline to (R)-reticuline. storr-2 
and storr-3 are both disrupted in the CYP82Y2 
module: storr-2 contains a premature stop at 
codon position W278, and storr-3 contains a 
missense mutation causing a glycine-to-arginine 
substitution at position 550 (Fig. 1A). Dried 
capsules of storr-2 and storr-3 accumulate (S)- 
reticuline but not 1,2-dehydroreticuline, sug- 
gesting that the CYP82Y2 module is responsible 
for the first epimerization step, the oxidation 
of (S)-reticuline to 1,2-dehydroreticuline. Com- 
plementation tests and F, segregation analysis 
confirmed that a single genetic locus is respon- 
sible for the high-reticuline phenotype, associ- 
ation of the three recessive storr alleles with 
the high-reticuline phenotype, and the consec- 
utive roles of the CYP82Y2 and oxidoreductase 
modules in the epimerization of (S)- to (R)- 
reticuline (tables S2 and S3). 

To establish whether the STORR locus is not 
only transcribed but also translated as a fusion 
protein, we used a quantitative mass spectrom- 
etry approach after gel fractionation of crude 
protein extract from HM2. Peptides from across 
the entire STORR protein were found to be 
most abundant in the gel regions covering the 
100.65-kD predicted size of the fusion protein, 
confirming this as the in vivo form (Fig. 2). For 
direct functional characterization, the STORR 
fusion protein and the separate modules were 
expressed in Saccharomyces cerevisiae, and en- 
zyme assays were performed on soluble extracts 
and microsomal preparations (Fig. 3). We found 
that 1,2-dehydroreticuline is converted to (R)- 
reticuline with 100% conversion efficiency by both 
the STORR fusion protein and the oxidoreductase 
module, but not by the CYP82Y2 module plus its 
redox partner (Fig. 3A). In contrast, the CYP82Y2 
module plus its redox partner catalyzed 97% con- 
version of (S)-reticuline to 1,2-dehydroreticuline, 
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Fig. 1. Characterization of opium poppy mutants disruptedin A 
the conversion of (S)- to (R)-reticuline. (A) Schematic showing 
epimerization of (S)- to (R)-reticuline and position of the storr-1, 
storr-2, and storr-3 mutations in the predicted fusion protein. (B) 
Mean + SD capsule reticuline content in the HM2 wild-type 
cultivar and storr mutants (HM2, n = 5; storr-1, n = 12; storr-2,n = 17; 


storr-3, n = 15). DW, dry weight. Reticuline content was verified as (S)-Reticuline 1,2-Dehydroreticuline (F7-Reticuling 
>99.2% (S)-reticuline in all mutants by chiral high-performance storr-2 storr-3—_ storr-1 

liquid chromatography (HPLC) (table S1). (C) 1,2-dehydroreticuline. 834 G/A 1648 G/A 2004 G/A 

(D) All compounds >1% total alkaloids (n = 10) are individually W278" GS550R W668" 
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Fig. 2. Size determination of STORR pro- Gel Peptide ratio: Light/Heavy 
tein in the opium poppy. Protein extracts fraction 0.0 0.5 1.0 15 20 25 


from three stem samples of HM2 wild type 
were fractionated, together with size markers, 
by SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE), and the three lanes were each 
cut into 15 fractions (see horizontal lines on 
one representative lane) to resolve the pre- 
dicted fusion protein and putative individual 
CYP450 and oxidoreductase modules (101, 
65, and 36 kD, respectively). For relative 
quantification, an equal amount of a tryptic 
digest of °N-labeled recombinant STORR 
protein was spiked into the in-gel digest of 
each SDS-PAGE fraction before HPLC mass 
spectrometry (HPLC-MS). Ratios of peak 
areas from extracted-ion chromatograms of 
light (endogenous) to heavy (labeled) versions 
of five peptides from across the STORR 
protein sequence were compared. Ratios of 
normalized peak areas from extracted-ion 
chromatograms were converted to binary 
logarithms for the calculation of means and 
standard errors of the mean. Only measure- 
ments where the respective peptides were 
found in all three biological replicates are 
shown. 
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Fig. 3. Functional characterization of the STORR fusion protein by 
heterologous expression in S. cerevisiae. (A) HPLC-MS analysis of the 
in vitro conversion of 1,2-dehydroreticuline to (R)-reticuline. Crude soluble 
(sol) or microsomal (ms) preparations harboring the empty pESC-TRP 
vector (green), vector containing the oxidoreductase module (purple), an 
opium poppy cytochrome P450 reductase (CPR) redox partner (black), CPR 
+ CYP82Y2 (blue), or CPR + the CYP82Y2-oxidoreductase fusion (red) were 
assayed (21). The solid lines of the HPLC-MS chromatograms show the 
normalized total ion count at a mass/charge ratio (m/z) of 328, corre- 
sponding to 1,2-dehydroreticuline (substrate), whereas the dotted lines 
show the normalized total ion count at m/z 330, corresponding to reticuline 
(product). The inset panel shows the chiral analysis of reticuline: The gray 
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trace is for an (S)- and (R)-reticuline standard, and the purple and red traces 
correspond to reticuline derived by activity of the oxidoreductase and 
CYP82Y2-oxidoreductase fusion, respectively. (B) HPLC-MS analysis of the 
conversion of (S)-reticuline into 1,2-dehydroreticuline and (R)-reticuline. 
Crude microsomal preparations obtained from S. cerevisiae harboring 
expression vector pESC-TRP containing CPR only (black), CPR + a CYP82Y2 
module (blue), or CPR + CYP82Y2-oxidoreductase fusion (red) were 
assayed (21). The solid lines of the HPLC-MS chromatograms show the 
normalized total ion count at m/z 328, corresponding to 1,2-dehydroreticu- 
line, and the dotted lines show the normalized total ion count at m/z 330, 
corresponding to reticuline. The inset panel shows the chiral analysis of 
reticuline, with the same line colors as in the main panel. 


demonstrating that it acts as a 1,2-dehydro- 
reticuline synthase (Fig. 3B). Microsomal prep- 
arations harboring the entire STORR fusion 
protein converted about 20% of the added 
(S)-reticuline to (R)-reticuline, confirming the 
bifunctional role of the protein in performing 
sequential reactions in the epimerization of 
reticuline. Kinetic analysis revealed that the 
microsomal CYP82Y2 module alone and the 
CYP82Y2-oxidoreductase fusion had similar 
Michaelis constant values of 13 and 14 uM for 
(S)-reticuline and 1,2-dehydroreticuline, respec- 
tively (fig. S3). Consistent with the plant mutant 
phenotypes, we found that microsomally ex- 
pressed STORR carrying the storr-2 mutation 
lacks both the P450 and oxidoreductase activ- 
ities, the storr-3 mutation lacks the P450 acti- 
vity but does still exhibit the oxidoreductase 
activity, and the storr-1 mutation has lost the 
oxidoreductase activity but maintains very low 
levels of P450 activity (fig. S4). 

P4.50-redox systems where the P450 enzyme 
is covalently linked to redox partner reductase 
components are well known in both prokaryotes 
and lower eukaryotes (24). In the STORR fusion 
protein, the P450 module is linked to a reduc- 
tase, but rather than functioning as a redox 
partner for the P450, this reductase catalyzes 
the product of the P450 to complete a two-step 
epimerization of (S)- to (R)-reticuline. Other forms 
of bifunctional P450 fusions with oxygenase/ 
peroxidase, hydrolase, and dioxygenase modules 
have been reported to occur in ascomycetes, and 
all of these also appear to catalyze sequential 
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reactions (25-27). A possible explanation as to 
why such fusion proteins evolve is that they fa- 
cilitate efficient channeling of highly unstable 
or reactive intermediates. Evidence for efficient 
substrate channeling in the case of the STORR 
fusion protein comes from the observation that 
microsomal fractions harboring the fusion pro- 
tein directly convert (S)- to (R)-reticuline, with 
no detectable accumulation of 1,2-dehydroreticu- 
line (Fig. 3B). 

Phylogenetic analysis suggests that the STORR 
fusion occurred after the split of the CYP82X 
and the CYP82Y subfamilies and that the oxi- 
doreductase module falls into subfamily 4 of 
the aldo-keto reductases, with closest homol- 
ogy to the codeinone reductase family from 
P. somniferum (fig. S5). A query of the 1K plant 
transcriptome resource (28) identified similar 
predicted module arrangements in EST collec- 
tions from two other morphinan-producing 
Papaver species, P. bracteatum and P. setigerum, 
but not in P. rhoeas, which does not make mor- 
phinans (table S4). We hypothesize that the 
STORR fusion was the key step in the evolution 
of the morphinan branch of BIA metabolism. 
After this step, other enzymes were recruited 
and adapted, including dioxygenases and re- 
ductases, ultimately giving rise to codeine and 
morphine in P. somniferum and P. setigerum 
(29). Thus, this morphinan biosynthetic path- 
way, and probably other plant secondary meta- 
bolic pathways, depends on the organization 
of both individual gene structure and genome 
rearrangement (22). 
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CIRCADIAN RHYTHMS 


Atomic-scale origins of slowness in 
the cyanobacterial circadian clock 


Jun Abe,’* Takuya B. Hiyama,’* Atsushi Mukaiyama,”?* Seyoung Son,?*+ 
Toshifumi Mori,”*** Shinji Saito,»”** Masato Osako,? Julie Wolanin,’* Eiki Yamashita,° 


Takao Kondo,’ Shuji Akiyama””+ 


Circadian clocks generate slow and ordered cellular dynamics but consist of fast-moving 
bio-macromolecules; consequently, the origins of the overall slowness remain unclear. 
We identified the adenosine triphosphate (ATP) catalytic region [adenosine triphosphatase 
(ATPase)] in the amino-terminal half of the clock protein KaiC as the minimal pacemaker 
that controls the in vivo frequency of the cyanobacterial clock. Crystal structures of the 
ATPase revealed that the slowness of this ATPase arises from sequestration of a lytic water 
molecule in an unfavorable position and coupling of ATP hydrolysis to a peptide isomerization 
with high activation energy. The slow ATPase is coupled with another ATPase catalyzing 
autodephosphorylation in the carboxyl-terminal half of KaiC, yielding the circadian 
response frequency of intermolecular interactions with other clock-related proteins that 
influences the transcription and translation cycle. 


ircadian clocks comprise suites of biological 
processes that oscillate with a 24-hour pe- 
riod (1). Clock genes and clock proteins 
are present in prokaryotes and eukaryotes 
(2, 3); together, they constitute feedback 
loops that effect transcriptional and transla- 
tional oscillations (TTOs). The origin of the slow 
circadian time scale is thought to be the time 
delay between clock gene transcription and feed- 
back signals that regulate it; however, the tran- 
scriptional and translational events can occur 
quickly (i.e., within minutes) (4). Posttranslational 
oscillations (PTOs) (5-7) in biochemical modi- 
fications of clock proteins occur even without 
transcriptional and translational regulation. 
Proteins generally exhibit dynamics within pico- 
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seconds or seconds (8), much faster than the 
circadian time scale. Thus, both TTO and PTO 
circadian systems are assembled from building 
blocks with intrinsically fast dynamics, raising 
questions about how and why the systems are 
so slow and stable overall (9). 

The cyanobacterium Synechococcus elongatus 
PCC7942 is the simplest organism known to have 
both TTOs (10) and PTOs (5). The S. elongatus 
PTOs can be reconstructed in vitro by incubat- 
ing a core clock protein, KaiC, with two other 
clock proteins, KaiA and KaiB, and adenosine 
triphosphate (ATP) (6). The rhythmic behaviors 
of the Kai oscillator have been confirmed in 
many functional and structural analyses, which 
have probed the ATP hydrolysis [adenosine tri- 
phosphatase (ATPase)] activity of KaiC (71, 12), 
autophosphorylation and autodephosphoryla- 
tion activities of KaiC (6, 13, 14), conformational 
transitions of the proteins (12, 15, 16), and as- 
sembly or disassembly of Kai complexes (77-20). 
Because the PTO period in S. elongatus is firmly 
correlated to the TTO period during the day 
(5, 6), the in vitro Kai oscillator should enable 
us to identify the mechanisms underlying the 
slowness of the circadian clock. 

We searched the Kai oscillator for a minimal 
slow reaction whose efficiency correlated with 
in vivo TTO frequency (Fig. 1) (see supplementary 
materials and methods). The ATPase activity 
of full-length wild-type KaiC (KaiC-WT), con- 


sisting of the N-terminal C1 and C-terminal C2 
domains, has been proposed as the basic timing 
cue (11, 12). We identified the steady-state ATPase 
activity of the C1 domain (Cl-ATPase) as a suit- 
able slow reaction. A truncated version of KaiC 
consisting solely of the N-terminal domain, KaiCl- 
WT, exhibited much slower ATP hydrolysis (11 + 
1 ATP per day per KaiC at 30°C) than well-known 
motor proteins (10? to 10’ day”) such as myosin, 
kinesin, and F,-ATPase (table S1). We confirmed 
the correlation of this activity with the in vivo TTO 
frequency using a series of period-modulating 
KaiC mutations in the C1 domain [S”’—P” 
(S157P), S157C, T42S, S48T] (27) (Fig. 1D), in which 
higher steady-state Cl-ATPase in vitro resulted 
in higher-frequency TTOs in vivo. Thus, KaiC 
should experience a certain number of hydrol- 
ysis events per cycle in vivo, and the absolute 
rate of ATPase activity is connected to the cel- 
lular clock’s overall slowness (11, 22). Therefore, 
we examined the structural origin of the slow 
Cl-ATPase and its coupling with TTOs via spatio- 
temporally distinct events, including the intra- 
molecular KaiC ATPase and its phosphorylation 
cycles, as well as intermolecular interactions with 
KaiA and KaiB (Fig. 1). 

To this end, we crystallized KaiC1-WT and five 
period-modulating variants in the pre- or posthy- 
drolysis states, or both. All resultant crystals were 
in the P2,2,2, space group (Fig. 2, A and B, fig. S1, 
and table S2). The prehydrolysis states (Fig. 2A, 
blue subunits) exhibited common features: assem- 
bly of six subunits into a hexamer and incorpo- 
ration of one molecule of the slowly hydrolyzed 
ATP analog adenosine 5’-(y-thiotriphosphate) 
(ATP-y-S) into every subunit-subunit interface. 
ATP-y-S existed in a complex with a Mg* ion (Mg- 
ATP-y-S) in the ordinary octahedral-coordination 
geometry (fig. S2A). We observed the posthydrol- 
ysis state of the long-period variant KaiCl-S48T 
(Fig. 2B, orange and green subunits), which crys- 
tallized as an asymmetrically ATP- or adenosine 
diphosphate (ADP)-bound hexamer (fig. S2B). 
The ring-shaped hexamer was deformed asym- 
metrically due to steric constraints resulting from 
close juxtaposition of three types of subunits, creat- 
ing both tight and loose intersubunit interfaces 
(Fig. 2B, fig. S3, and supplementary text). 

We identified two structural sources of slow 
ATPase activity. The first is the regulatory influ- 
ence of the protein moiety on a lytic water mol- 
ecule (WI) near the phosphorus atom (P,) of the 
y-phosphate group of an ATP. In the prehydrol- 
ysis state (blue dotted box in Fig. 2C), W1 was 
sequestered (with a WI-P, distance of 3.8 to 
3.9 Aand a W1-P,-Osg angle of 154° to 158°) by 
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H-bonding to the F199 carbonyl oxygen, the 
nitrogen atom of R226 side chain (N,), and 
another water molecule (W2). This position of W1 
was much farther from a near-in-line configu- 
ration, with respect to the P,-O3, bond (3.0 A 
180°), than analogous water molecules in other 
motor proteins that hydrolyze ATP efficiently— 
e.g., kinesin (3.3 A, 164° to 167°) (23), myosin 
(2.3 A, 170°) (24), and F,-ATPase (3.1 to 3.8 A, 
168° to 172°) (25) (Fig. 2D and table S1). Molecu- 
lar dynamics simulation suggested that W1 
was only marginally stabilized. Although a water 
molecule might occasionally migrate into and 
out of the W1 position, the near-in-line position 
was completely inaccessible due to steric hin- 
drance from the F199 carbonyl oxygen and the 
R226 N, (Fig. 2E and movies S1 to $3). Thus, 
the sequestration of W1 from the near-in-line 
position is one atomic-scale origin of slow hy- 
drolysis. In another crystal form of KaiC1-WT 
(P3,21 space group; ocher subunits in Fig. 2F and 
fig. SIA), W1 was even farther from the P, of the 
y-phosphate group, which should be even more 
unfavorable to hydrolysis than the W1 position 
that we observed in the P2,2;2; space group. In 
yet another crystal form, more or less unfavorable 
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Fig. 1. trans-hierarchical nature of the cya- 
nobacterial circadian clock. (A to J) Scatter- 
plot matrix showing all pairwise correlations 
among the frequency of the in vivo TTO cycle 
(10), frequency of the in vitro phosphorylation 
cycle of the Kai oscillator, undamped natural 
frequency (m,) of KaiC alone, steady-state 
ATPase in the full-length KaiC-WT background, 
and steady-state C1-ATPase in the KaiC1-WT 
background. r, correlation coefficient. 
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positioning of W1 occurred asymmetrically within 
the hexamer (fig. SIB). On the basis of nine struc- 
tures (Fig. 2F, fig. S1, and supplementary text), 
we identified S157P and Q153A as mutations that 
indirectly stabilized the less unfavorable position- 
ing of WI (blue subunit), resulting in higher 
C1-ATPase activity (Fig. 1J). By contrast, S157C 
stabilized the more unfavorable positioning of 
W1 (ocher subunit) through N-terminal fray of 
a7 (fig. S4) and repositioning of a6, resulting in 
lower C1-ATPase activity (Fig. 1J). 

The second origin of slowness involves cou- 
pling of ATP hydrolysis to slow cis-trans isomeriza- 
tion of the peptide chain. In the prehydrolysis 
state, the peptide bond between D145 and S146 
(D"’s"* peptide) mainly adopted the cis confor- 
mation (Fig. 3A). By contrast, in the posthydroly- 
sis state, the D““°S™* peptide was entirely in the 
trans conformation (Fig. 3A and fig. S5A). This 
trans selectivity was achieved through hydrolysis 
of ATP bound in the counterclockwise (CCW) 
interface (Fig. 2C, dotted boxes: blue — orange —> 
green; see also supplementary text), along with 
repositioning of helices «6 and «7 on the clock- 
wise (CW) side (Fig. 3A). These structural obser- 
vations support the idea that hydrolysis of ATP 
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bound in the CCW interface results in a confor- 
mational change that forces the D'4°S™® peptide 
to adopt the trans conformation. 

According to our computational results, the 
prehydrolysis state containing the cis-D'*°s'*° 
peptide must overcome a barrier of 14 to 16 kcal 
mol! (Fig. 3B) for cis-trans isomerization and 
a barrier of 11 to 17 kcal mol”! (table S1) to 
disrupt the P,-O3, bond. These two events are 
potentially related to each other through re- 
positioning of helices o6 to o8 (fig. S5G and 
supplementary text). Given a frequency factor of 
~10” s", the rate of a reaction that must over- 
come a barrier of ~22 kcal mol” is ~0.5 hours * 
(~12 ATP day”) at 30°C. The impact of cis-trans 
flipping was confirmed using the KaiCl-S146P 
mutant, in which the D'“°S™® peptide is forced 
to adopt the cis conformation (fig. S5B). More- 
over, full-length KaiC-S146P exhibited a nota- 
ble decrease in steady-state ATPase activity 
(4.8 ATP per day per KaiC), as well as a nearly 
50% increase in the phosphorylation cycle pe- 
riod (34 hours). Thus, cis-trans flipping of the 
D"°s™° peptide could impose a substantial en- 
ergy barrier on the hydrolysis-coupled transition 
of the subunits. 
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In full-length KaiC, the slow C1-ATPase is in- 
tegrated with another ATPase in the C2 domain 
(C2-ATPase), forming a coupled Cl-C2-ATPase 
system. At steady state, the Cl-ATPase predomi- 
nated over the C2-ATPase (Fig. 1J). The pacemak- 
ing role of Cl-ATPase became obvious during the 
approach to the steady state (pre-steady state). 
Immediately after the addition of excess ATP to 
full-length apo-KaiC protein (26), nascent KaiC- 
WT hydrolyzed 35 ATP per day per KaiC at 30°C 
(Fig. 4A and fig. S6A). The activity suddenly de- 
creased to ~8 ATP per day per KaiC during the 
first 6 hours and then stably recovered to and 
remained at 13 + 1 ATP per day per KaiC, the ac- 
tivity required for steady-state KaiC-WT (17). The 
biphasic-exponential phases reflected the pre- 
steady-state relaxation of the coupled C1-C2- 
ATPase (supplementary text). Minimal dependen- 
cies on ATP, ADP, and KaiC protein concentrations 
(fig. S6B) further supported the hydrolysis-related 
maturation of ATP- or ADP-bound KaiC-WT hex- 
amer (supplementary text). The phosphorylation 
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state of the C2 domain was only weakly associated 
with the slow process, as the phosphorylation- 
mimic KaiC-S431D/T432E exhibited detectable 
relaxation (fig. S6C). 

Response speed of the Cl-C2-ATPase is con- 
trolled by the C1-ATPase. Long-period C1 mutants 
(T42S, S157C, S48T, and A251V) with suppressed 
C1-ATPase activity exhibited slower relaxations 
of the Cl-C2-ATPase than KaiC-WT, whereas short- 
period C1 mutants (Q153A and S157P) with elevated 
Cl1-ATPase activity exhibited more rapid relaxation 
with a deeper undershoot (Fig. 4B). These obser- 
vations suggested that steady-state C1-ATPase 
activity governs the speed of pre-steady-state 
relaxation of the Cl-C2-ATPase and oscillation 
period in vitro (Fig. 1G) and in vivo (Fig. 1D). The 
undamped natural frequency (@,) is another mea- 
sure of the response speed of the coupled C1-C2- 
ATPase, as determined by fitting each transient 
curve of ATPase activity (Fig. 4B) to the initial- 
state response curve of the quasi-second-order 
system (supplementary text). Furthermore, the @, 
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value of KaiC-WT alone in vitro was 0.91 + 0.01 
day’, matching the circadian oscillatory fre- 
quency. We confirmed fine correlations among 
steady-state C1-ATPase activity, w, value, and 
in vivo frequency of TTOs for a series of the 
period-modulating C1 mutations (Fig. 1, B, D, 
and I). Thus, the slow C1-ATPase contributes 
to the circadian pacemaker for both the C1-C2- 
ATPase system and TTOs. 

Slow relaxation of the C1-C2-ATPase was re- 
lated to emergence of temperature compensa- 
tion in full-length KaiC. Early ATPase activity 
(<2 hours in Fig. 4C) decayed in a temperature- 
dependent manner (thermal sensitivity Qi5° = 1.4) 
(inset of Fig. 4C), whereas steady-state activity 
after the later decay (>24 hours) was almost in- 
dependent of temperature (Qij” = 1.1). Because 
the contribution of C2-ATPase was negligibly 
small at steady state (Fig. 1J), the activity that 
did not vary with temperature can be attributed 
to the Cl-ATPase. Thus, steady-state Cl-ATPase 
activity may be kept constant in each hexamer 
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Fig. 2. Reaction cycle of the C1-ATPase, a slow but stable ATPase that serves as the basic timing 
cue. (A) Schematic overview of the crystal structure of ATP-y-S—bound KaiC1-WT (P2,2)2)). The drawing 
in the box to the right of (B) summarizes the nomenclature used to describe the structures. For the seven 
other crystal structures, refer to details in fig. S1. (B) Schematic overview of the crystal structure of 
adenylyl imidodiphosphate (AMP-PNP), or ADP-bound KaiC1-S48T (P2)2)2;). (©) Zoomed-in views of the 
prehydrolysis A-B interface (blue dotted box) of Fig. 2A, posthydrolysis B-C interface (orange dotted 
box) of Fig. 2B, and posthydrolysis C—D interface (green dotted box) of Fig. 2B. The mesh indicates the 
Fobs — Fealc omit map of a potential lytic water molecule (W1) and another water molecule (W2), contoured 
at 40. (D) Schematic drawing of W1 positioning (filled circles) in KaiC1-WT, along with motor proteins that 
hydrolyze ATP efficiently (table S1). (E) Distances of a potential lytic water, the carbonyl oxygen atom of 
F199, and N, of R226 from the putative near-in-line position during a 10-ns molecular dynamics simulation. 
The inset describes the crystal structure of the prehydrolysis state of KaiC1-WT, with van deer Waals radii 
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by adjusting the ratio of prehydrolysis blue and 
ocher subunits and by scrambling their spatial 
arrangement (Fig. 2A and fig. S1, A and B). In 
this respect, oscillation of ATPase activity (Fig. 
4A) is realized by perturbing the intramolecular 
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homeostasis of KaiC ATPase via intermolecular 
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KaiC-WT were modulated in a biphasic and par- 
abolic manner on the same time scale as relax- 
ation of the C1-C2-ATPase (fig. S7, A and B). 
However, such time evolution was lost in KaiC- 
$431D/T432E (fig. S7, A and B), which retained 
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tide. (A) ATP hydrolysis in the CCW interface, 
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Fig. 4. Circadian periodicity determined by KaiC 
ATPase activity. (A) Damped oscillation observed 
for pre-steady-state relaxation of KaiC ATPase as 
the source of the circadian time scale at 30°C. In the 
presence of both KaiA and KaiB, the ATPase activity of 
KaiC-WT exhibits a stable oscillation around its steady- 
state ATPase activity. The bar at lower left indicates the 
maximal contribution from ATP synthesis during the first 
hour (27). (B) Dynamic responses of the ATPase activity 
observed for period mutants of full-length KaiC. Each 
curve is offset longitudinally for clarity of presentation. 
The values in parentheses represent the periods of 
in vitro phosphorylation rhythms. (C) Temperature de- 
pendency of pre-steady-state dynamics of ATPase ac- 
tivity in KaiC-WT. The insets represent Arrhenius plots. 
Errors in values are derived from a linear regression analy- 
sis. EA represents the apparent activation energy. 
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pre-steady-state relaxation of C1/C2-ATPase (fig. 
S6C). Thus, C1-C2-ATPase status is coupled to 
intermolecular interactions with KaiA and KaiB 
through a phosphorylation-dependent confor- 
mational change of KaiC. This idea is also sup- 
ported by the fact that the C2-ATPase is essential 
for the complete autodephosphorylation (27). 
Thus, absolute slowness encoded in the Cl-ATPase 
is transferred to and correlated with the TTO cy- 
cle (Fig. 1) through a pathway in which non-steady- 
state relaxation of C1-C2-ATPase leads to cycles 
of phosphorylation that alter intermolecular in- 
teractions with KaiA or KaiB or other clock- 
related proteins (28, 29). 

In cyanobacteria, slow homeostatic regulation 
of the coupled Cl1-C2-ATPase is critical for cir- 
cadian periodicity (Figs. 1 and 4A) and also has 
an important role in entrainment of individual 
KaiC molecules by external stimuli such as tem- 
perature changes (30). Our results suggest how 
ancient cyanobacteria have incorporated Earth’s 
rotation period into their molecular systems. 


REFERENCES AND NOTES 

1. C.S. Pittendrigh, Annu. Rev. Physiol. 55, 17-54 (1993). 

2. D. Bell-Pedersen et al., Nat. Rev. Genet. 6, 544-556 (2005). 

3. J. S. Takahashi, H. K. Hong, C. H. Ko, E. L. McDearmon, 

Nat. Rev. Genet. 9, 764-775 (2008). 

4. B. Schwanhausser et al., Nature 473, 337-342 (2011). 

5. J. Tomita, M. Nakajima, T. Kondo, H. Iwasaki, Science 307, 
251-254 (2005). 

6. M. Nakajima et al., Science 308, 414-415 (2005). 

7. J.S. O'Neill, A. B. Reddy, Nature 469, 498-503 (2011). 

8. M. Karplus, J. Phys. Chem. B 104, 11-27 (2000). 

9. S. Akiyama, Cell. Mol. Life Sci. 69, 2147-2160 (2012). 

0. M. Ishiura et al., Science 281, 1519-1523 (1998). 

1. K. Terauchi et al., Proc. Natl. Acad. Sci. U.S.A. 104, 

16377-16381 (2007). 
2. Y. Murayama et al., EMBO J. 30, 68-78 (2011). 
3. T. Nishiwaki et al., EMBO J. 26, 4029-4037 (2007). 
4. M. J. Rust, J. S. Markson, W. S. Lane, D. S. Fisher, E. K. O'Shea, 
Science 318, 809-812 (2007). 

5. Y. G. Chang, N. W. Kuo, R. Tseng, A. LiWang, Proc. Natl. Acad. 
Sci. U.S.A. 108, 14431-14436 (2011). 

6. Y. G. Chang, R. Tseng, N. W. Kuo, A. LiWang, Proc. Natl. Acad. 
Sci. U.S.A. 109, 16847-16851 (2012). 

7. S. Akiyama, A. Nohara, K. Ito, Y. Maéda, Mol. Cell 29, 703-716 
(2008). 

8. H. Kageyama et al., Mol. Cell 23, 161-171 (2006). 

9. X. Qin et al., Proc. Natl. Acad. Sci. U.S.A. 107, 14805-14810 
(2010). 

20. R. Tseng et al., J. Mol. Biol. 426, 389-402 (2014). 

21. Single-letter abbreviations for the amino acid residues are as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; 
|, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg: 

S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 

22. T. Kondo, in Clocks and Rhythms, B. Stillman, D. Stewart, 
Eds. (Cold Spring Harbor Laboratory Press, New York, 2008), 
pp. 47-55. 

23. C. L. Parke, E. J. Wojcik, S. Kim, D. K. Worthylake, J. Biol. 
Chem. 285, 5859-5867 (2010). 

24. C. A. Smith, |. Rayment, Biochemistry 35, 5404-5417 (1996). 

25. M. W. Bowler, M. G. Montgomery, A. G. Leslie, J. E. Walker, 

J. Biol. Chem. 282, 14238-14242 (2007). 

26. A. Mukaiyama, M. Osako, T. Hikima, T. Kondo, S. Akiyama, 
BIOPHYSICS 11, 79-84 (2015). 

27. T. Nishiwaki, T. Kondo, J. Biol. Chem. 287, 18030-18035 (2012). 

28. M. Hanaoka et al., J. Biol. Chem. 287, 26321-26327 (2012). 

29. H. Iwasaki et al., Cell 101, 223-233 (2000). 

30. T. Yoshida, Y. Murayama, H. Ito, H. Kageyama, T. Kondo, 
Proc. Natl. Acad. Sci. U.S.A. 106, 1648-1653 (2009). 


ACKNOWLEDGMENTS 

Diffraction data were collected at BL44XU in the SPring-8 facility 
under the proposals 2009A6902, 2009B6902, 2010A6502, 
2010B6502, 2011A6602, 2011B6602, 2012A6702, 2012B6702, and 
2013B6700. MX225-HE was financially supported by Academia 
Sinica and National Synchrotron Radiation Research Center 


316 = 17 JULY 2015 + VOL 349 ISSUE 6245 


(Taiwan, China). This work was supported by Grants-in-Aid for 
Scientific Research (25291039, 22687010, and 26102544 to S.A.; 
24000016 to T.K.; 26888019 to T.M.; and 25288011 to S.Saito); 
the Platform for Drug Discovery, Informatics, and Structural Life 
Science from the Ministry of Education, Culture, Sports, Science 
and Technology of Japan; and partly by the Okazaki ORION 
project. The calculations were in part performed at the 
Research Center for Computational Science in Okazaki, 

Japan. S.A. designed the experiments. A.M., J.A., J.W., and 

M.O. conducted ATPase experiments. S.Son screened initial 
crystallization conditions, and J.A. and T.B.H. further refined them. 
S.Son, J.A., T.B.H., and S.A. collected diffraction data and analyzed 
the structures with inputs from E.Y. T.M. and S.Saito conducted 
molecular dynamics simulations. S.A., J.A., T.B.H., A.M., T.M., 
S.Saito, and T.K. drafted the manuscript with input from all authors. 
We declare no conflicts of interest. The atomic coordinates and 
structure factors are deposited in the Protein Data Bank with 


accession codes 4TL8 (KaiC1-WT, P2,2,2;), 4TLO (ATPyS-bound 
KaiC1-S48T), 4TLA (ADP-bound KaiC1-S48T), 4TLB (KaiC1-S146P), 
ATLC (KaiC1-S146G), 4TLD (KaiC1-S157P), 4TLE (KaiC1-S157C), 4TL7 
(KaiC1-WT, P3;21), and 4TL6 (KaiC1-WT, C2). 


SUPPLEMENTARY MATERIALS 


www.sciencemag.org/content/349/6245/312/suppl/DC1 
Materials and Methods 

Supplementary Text 

Figs. S1 to S7 

Tables S1 to S3 

References (31-51) 

Movies S1 to S3 


9 September 2014; accepted 10 June 2015 
Published online 25 June 2015; 
10.1126/science.1261040 


INFLAMMATION 


Neutrophil extracellular traps license 
macrophages for cytokine production 


in atherosclerosis 


Annika Warnatsch, Marianna Ioannou, Qian Wang, Venizelos Papayannopoulos* 


Secretion of the cytokine interleukin-1p (IL-1B) by macrophages, a major driver of pathogenesis 
in atherosclerosis, requires two steps: Priming signals promote transcription of immature 
IL-1p, and then endogenous “danger” signals activate innate immune signaling complexes 
called inflammasomes to process IL-1B for secretion. Although cholesterol crystals are 
known to act as danger signals in atherosclerosis, what primes IL-1f transcription remains 
elusive. Using a murine model of atherosclerosis, we found that cholesterol crystals acted 
both as priming and danger signals for IL-1B production. Cholesterol crystals triggered 
neutrophils to release neutrophil extracellular traps (NETs). NETs primed macrophages for 
cytokine release, activating T helper 17 (T,17) cells that amplify immune cell recruitment in 
atherosclerotic plaques. Therefore, danger signals may drive sterile inflammation, such as that 
seen in atherosclerosis, through their interactions with neutrophils. 


nflammation is critical against infection but 

must be regulated by multiple checkpoints 

to prevent inflammatory disease (1). During 

infection, cytokine transcription is triggered 

by microbial molecules that activate pattern 
recognition receptors (2). Release of mature active 
cytokines requires additional “danger” signals as- 
sociated with host cell damage. Known as danger- 
associated molecular patterns (DAMPs), these 
secondary signals activate NLRP3 and other in- 
flammasomes, promoting cleavage and activation 
of the protease caspase-1 that processes cytokines 
such as interleukin-1f (IL-18) into their mature 
form (3). 

IL-1f plays a critical role in the development of 
atherosclerosis and other inflammatory diseases. 
Because of its low solubility, cholesterol crystallizes 
in circulation and is taken up by monocyte-derived 
macrophages (4-6), activating their inflamma- 
somes to release IL-18 and other proinflamma- 
tory cytokines (7). These molecules recruit myeloid 
cells to the vascular endothelium, where their 
cholesterol content generates obstructive lesions 
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(8). In atherosclerosis and other sterile inflam- 
matory diseases, the endogenous priming signals 
that activate IL-18 transcription prior to inflam- 
masome activation remain unknown. 

IL-1B up-regulates chemokines that recruit 
neutrophils to atherosclerotic lesions (9-11). Neu- 
trophils are implicated in disease (12, 13), but their 
role in pathogenesis remains poorly understood. 
To combat pathogens that evade phagocytosis (14), 
neutrophils release neutrophil extracellular traps 
(NETs) composed of decondensed chromatin and 
antimicrobials (75). NETs are implicated in several 
inflammatory diseases (16), but their pathogenic 
mechanism and role in atherosclerosis are unclear. 

To examine how neutrophils respond during 
atherosclerosis, we investigated the effect of cho- 
lesterol crystals on human blood-derived neutro- 
phils. Cholesterol crystals induced NET formation 
(NETosis) (Fig. 1, A and B) at concentrations re- 
quired to activate the inflammasome (fig. S1, A 
and B) (7) as efficiently as microbes (J4), trigger- 
ing a reactive oxygen species (ROS) burst (fig. S1C) 
and neutrophil elastase (NE) translocation to the 
nucleus (fig. SID), a critical step for NETosis (17). 
NETosis depended on ROS, as it was blocked 
by diphenylene iodonium [DPI; an inhibitor of 
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reduced nicotinamide adenine dinucleotide phos- 
phate (NADPH) oxidase] or an inhibitor (NEi) 
of the neutrophil-specific proteases NE and 
proteinase 3 (PR3) (17) but not by Cl-amidine, 
which inhibits peptidylarginine deiminase (PAD) 
enzymes implicated in NETosis (78) (Fig. 1, A 
and B, and fig. SIE). Consistently, DPI or NEi 
blocked NE translocation to the nucleus driven 
by cholesterol (fig. S1D) (19). 

Next, we examined whether NETs form during 
atherosclerosis. Previous studies reported the pres- 
ence of NETs in lesions but showed intact neu- 
trophils with condensed nuclei (20) or luminar 
rather than lesion-associated neutrophils in the 
absence of specific NET markers (27). We de- 
tected NETs as large amorphous extracellular 
structures in atherosclerotic lesions from apo- 
lipoprotein E (ApoE)-deficient mice that were 
placed on a high-fat diet (HFD) for 8 weeks to 
induce hypercholesterolemia (Fig. 1C). NETs formed 
in cholesterol-rich areas but were absent from 
adjacent adventitia (fig. S2A). To block NETosis 
in vivo, we crossed ApoE-deficient animals with 
mice deficient in PR3 and NE, because deleting 
both enzymes may abrogate NETosis more effec- 
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tively (22). NETs were completely absent in le- 
sions of ApoE/PR3/NE-deficient mice after 8 weeks 
on HFD (Fig. 1C) and ApoE-deficient mice after 
6 weeks on HFD receiving deoxyribonuclease 
(DNase), which degrades NETs (23) (fig. S2B). 

Subsequently, we assessed the effect of NET 
deficiency on atherosclerosis. When placed on 
HFD, ApoE- and ApoE/PR3/NE -deficient mice ex- 
hibited similar weight gain (fig. S3A) and blood 
cholesterol, triglyceride, and low-density lipo- 
protein (LDL) concentrations (fig. S3B). Analysis 
of aortic root cross sections showed that the two 
groups were modestly different after 4 weeks on 
HFD (fig. S3, C and D), suggesting that NETs did 
not play a critical role early during atherogenesis. 
However, after 8 weeks on HFD, ApoE/PR3/NE- 
deficient mice exhibited a factor of 3 reduction in 
plaque size relative to ApoE-deficient controls 
(Fig. 2, A and B). These differences were also 
reflected by en face analysis of intact aortas (fig. 
83, E and F). DNase injections into ApoE-deficient 
mice on HFD for 6 weeks resulted in a comparable 
factor of 3 reduction in lesion size, which excludes 
the possibility that the proteases played NET- 
independent roles (Fig. 2, C and D). Lesion growth 
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Fig. 1. Cholesterol crystals trigger NETosis. (A) Fluorescence micrograph of neutrophils incubated 
with cholesterol crystals and stained with the lipid dye Dil (magenta) and extracellular DNA (Sytox, 
cyan). Neutrophils were left untreated or treated with NE inhibitor (NEi) or the NADPH oxidase inhibitor 
DPI. Scale bars, 100 um. (B) Quantitation of NETosis in (A). Data are representative of three indepen- 
dent experiments. (C) Representative confocal immunofluorescence microscopy images of aortic root 
sections from ApoE “~ and ApoE/PR3/NE~“~ mice on HFD for 8 weeks and stained for MPO (cyan), 
citrullinated histone 3 (Cit-H3, yellow), chromatin (magenta), and DNA (DAPI, blue). Borders between 
the adventitia (A) and the lesion (dotted line) and lumen (L) are shown; scale bars, 50 um. The third 
row shows detail from the first row (arrow); scale bars, 20 um. Data are representative of eight mice 
analyzed per strain from two independent experiments. 
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was unaffected by DNase in ApoE/PR3/NE-deficient 
mice that lack NETs. 

NET-deficient mice exhibited a reduction in 
lesion growth that was comparable to mice lacking 
NLRP3 or the IL-1 receptor; this finding suggested 
that NETs may drive atherosclerosis by modulating 
cytokine production. Indeed, IL-la, IL-1, and IL-6 
were elevated in the plasma of ApoE-deficient ani- 
mals after 8 weeks on HFD but were largely absent 
in ApoE/PR3/NE-deficient mice (Fig. 3A and fig. 
S4A). After 16 weeks on HFD, IL-la but not IL- 
18 concentrations were still elevated in ApoE- 
deficient controls relative to NET-deficient animals 
(fig. S4B). DNase administration abrogated plasma 
cytokine concentrations in ApoE-deficient controls 
but had no effect on ApoE/PR3/NE-deficient mice 
(fig. S4C). Furthermore, IL-18 staining, which de- 
tects both immature and mature protein, was 
prominent in lesions from ApoE-deficient mice 
but was absent in ApoE/PR3/NE knockout ani- 
mals and colocalized with NETs and macrophages 
(Fig. 3B and fig. S4D). By contrast, IL-18 concen- 
trations were similar in the spleen or in blood 
mononuclear cells (fig. S4E), although we cannot 
exclude that systemic cytokines were not produced 
differentially elsewhere. In addition, IL-18 mRNA 
concentrations were significantly reduced in aortas 
from ApoE/PR3/NE-deficient mice (Fig. 3C). To- 
gether, these data indicate that NE and PR3 were 
not mediating IL-18 maturation posttranslation- 
ally, and instead suggest that NETs are essential 
for the transcription of proinflammatory cytokines. 

The requirement of NETs for cytokine produc- 
tion, and the proximity of NETs to macrophages 
(Fig. 3D) and IL-18 in lesions (Fig. 3B), prompted 
us to examine whether NETs regulate cytokine 
production by macrophages. We prepared NETs 
from cholesterol crystal-stimulated neutrophils 
(fig. S5A) (24) and investigated their effects on 
CD14-purified, blood-derived human monocytes 
in vitro. Stimulation with NETs or cholesterol 
crystals separately yielded minor increases in 
IL-16 and IL-6 concentrations in culture super- 
natants (Fig. 3E and fig. S5B). In contrast, mono- 
cytes released substantial cytokine concentrations 
when pretreated with NETs and subsequently 
stimulated with cholesterol crystals (Fig. 3E). By 
comparison, neutrophils were not a major source 
of cytokines, as they released negligible concen- 
trations in response to cholesterol crystals (fig. 
S5C). Degradation of NETs by DNase treatment 
(fig. S4A) abrogated cytokine release (fig. S5B), 
indicating the requirement for a DNA moiety. 
Consistently, an oligonucleotide (ODN) antago- 
nist of the pattern recognition DNA receptor Toll- 
like receptor 9 (TLR9) significantly reduced cytokine 
release in NET-treated monocytes, but not mono- 
cytes primed with bacterial lipopolysaccharide 
(LPS) (Fig. 3E). Because TLR9 is not expressed 
in monocytes, these data suggest that DNA is 
important for monocyte activation but that its 
detection is mediated via other DNA receptors 
blocked by ODN. Blocking with oligonucleotide 
did not fully inhibit IL-18 induction, so we rea- 
soned that additional non-DNA NET factors con- 
tribute to monocyte activation. Blocking TLR2 
and TLR4, which bind endogenous proteins, 
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Fig. 2. NETs promote atherosclerosis. (A) Two representative microscopy 
images of aortic root sections from ApoE” and ApoE/PR3/NE“~ mice on 
HFD for 8 weeks and stained for lipid (Oil Red O, red) and hematoxylin. Scale 
bars, 200 um. (B) Quantitation of plaque area relative to the area of the aortic 
lumen from (A); data are representative of 11 mice per strain pooled from two 
independent experiments. Each point is the mean from multiple sections per 
animal. (C) Representative microscopy images of aortic root sections from 
ApoE“ and ApoE/PR3/NE~“~ mice on HFD for 6 weeks and regularly injected 
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Fig. 3. NETs prime macrophages for cytokine release. (A) Plasma levels of 
IL-1B from wild-type (WT), ApoE”, and ApoE/PR3/NE~“ mice on HFD for 
8 weeks, measured by enzyme-linked immunosorbent assay (ELISA) in n = 17 
mice per strain pooled from three independent experiments. (B) Representa- 
tive confocal immunofluorescence microscopy images of aortic root sections 
from eight ApoE” and five ApoE/PR3/NE “~ mice on HFD for 8 weeks and 
stained with the macrophage marker Mac-3 (cyan), IL-1B (magenta), the 
neutrophil marker Ly6G (yellow), and DNA (DAPI, blue) in two independent 
experiments. Dashed line denotes the adventitia (A)—lesion boundary; L, lumen. 
Scale bars, 50 um. (C) Representative IL-18 mRNA levels in aorta of five ApoE 
and four ApoE/PR3/NE“— mice fed HFD for 8 weeks, repeated in two inde- 
pendent experiments and measured by quantitative polymerase chain reaction. 
mRNA levels were normalized to the monocyte/macrophage-specific gene 
lamp2 and expressed relative to levels measured in wild-type mice. A power 
analysis measured 89% power for the difference of means between ApoE “~ and 
ApoE/PR3/NE~“~ mice. (D) Representative confocal immunofluorescence 
microscopy images of aortic root sections from five ApoE ~~ mice on HFD 
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Plaque (Oil Red O) + H&E stain. 
intravenously with 120 U of DNase or vehicle contro! (0.9% NaCl). Stained for 
lipid (Oil Red O, red) and hematoxylin; scale bars, 200 um. (D) Quantitation of 
(C) as in (B). Data are representative of four or five mice analyzed per strain and 
condition. A power analysis revealed 92% power for the difference of means 
between NaCl- and DNase-treated ApoE ’~ mice. Statistics by Student’s t test 
for single comparison and two-way analysis of variance (ANOVA) followed by 


Sidak’s multiple comparison post test for multiple comparisons: **P < 0.01, 
***¥P < 0.001, ****P < 0.0001; n.s., not significant. 
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for 8 weeks and stained with the macrophage marker Mac-3 (cyan), Ly6G 
(magenta), MPO (yellow), and DNA (DAPI, blue). Scale bars, 50 um. Right panel 
is a close-up of the boxed area of the left panel; arrowheads point to macro- 
phages. Scale bars, 20 um. (E) Mature IL-18 (black bars, left y axis) or IL-6 (gray 
bars, right y axis) protein released by CD-14 blood-derived human monocytes 
untreated or treated with LPS or NETs alone or in the presence of cholesterol 
crystals. Where indicated, cells were treated with oligonucleotide inhibitor (ODN; 
10 ug/ml). (F) Whole-cell lysates or cell culture medium from naive CD-14 blood- 
derived human monocytes were treated with NETs or cholesterol crystals, 
analyzed by SDS—polyacrylamide gel electrophoresis, and immunoblotted for 
IL-1B, caspase-1, and actin. (G) IL-18 (left panel) or IL-6 (right panel) MRNA in 
naive CD-14 blood-derived human monocytes or treated with NETs alone (black 
bars) or in the presence of cholesterol crystals (gray bars). Statistics in (A) 
and (C) by two-tailed, unpaired Student's t test for single comparison and 
one-way ANOVA, followed by Tukey's multiple comparison posttest for multiple 
comparisons: **P < 0.01. In (E) and (G), data are representative of three inde- 
pendent experiments across three technical replicates; error bars denote SD. 
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decreased IL-1B release and synergized with 
oligonucleotide inhibition, indicating that both 
protein and DNA moieties are important in NET- 
mediated priming (fig. S5D). The complete abro- 
gation by DNase suggests that the association of 
these moieties is critical. In the absence of cho- 
lesterol, NETs did not induce substantial in- 
flammasome activation, as reflected by the lack 
of caspase-1 and IL-18 maturation (Fig. 3F), which 
were observed only upon costimulation with cho- 
lesterol crystals. Stimulation with NETs also up- 
regulated monocytic cytokine transcripts (Fig. 3G). 
These data are consistent with NETs providing 
priming signals in atherosclerosis. 

IL-18 up-regulates the T cell-derived cytokine 
IL-17, which drives the chemokines CXCL1 and 
CXCL2 to promote neutrophil recruitment dur- 
ing inflammation (9, 25). Both groups of mice 
exhibited comparable numbers of T cells (figs. S6 
and S7), but aortas from ApoE/PR3/NE-deficient 
mice on 8-week HFD contained few IL-17* T cells 
relative to ApoE-deficient controls (Fig. 4, A and 
B). IL-17-producing T cells could not be detected 
in aortas of ApoE’’~ animals after 4 weeks on 
HED (fig. S8, A and B), which suggests that strong 
T cell activation does not precede NET-driven 


inflammation. The blood of NET-deficient mice did 
not exhibit alterations in immune cell populations, 
and circulating IL-17* T cells were absent in both 
groups (figs. S9 and S10). The spleen and lymph 
nodes contained a small IL-17* yé T cell population 
but no IL-17* of T cells (figs. S11 and $12, A and B). 
Furthermore, concentrations of IL-17A, CXCL1, 
CXCL2, and the monocyte chemokine CCL2 were 
also significantly reduced in aortas of ApoE/PR3/NE- 
deficient mice (Fig. 4C). Consistently, we counted 
fewer neutrophils in lesions and adventitia of 
ApoE/PR3/NE-deficient mice, by microscopy (Fig. 
4D) and fluorescence-activated cell sorting (FACS) 
(fig. S7B). Although both groups contained a com- 
parable makeup of immune cells (fig. S7A), ADOE/ 
PR3/NE-deficient animals had significantly lower 
total immune cell counts per aorta (fig. S7B), con- 
sistent with reduced inflammation and smaller 
lesions. Adhesion molecule transcripts were similar 
in aortas from both groups, but differences may 
be difficult to detect because of the patchy lesion 
morphology (fig. S12C). 

Neutrophil recruitment was comparable in the 
skin of wild-type and PR3/NE-deficient animals 
treated with Aldara imiquimod (fig. S13, A and B), 
which drives IL-1 exogenously to promote sterile 
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Fig. 4. NETs drive IL-17 and neutrophil chemokine production in atherosclerosis. (A) Representative 
FACS plot of IL-17 intracellular staining in phorbol myristate acetate (PMA)-restimulated af and yé T cells 
from digested aortas of ApoE “~~ or ApoE/PR3/NE~“~ mice on HFD for 8 weeks. (B) Representative 
number of cells relative to CD45* populations and whole aortas from (A) for three animals per group 
repeated in two independent experiments. (C) IL-17A, CXCL1, CXCL2, and CCL2 concentrations from 
whole aorta samples measured by ELISA; n = 5 ApoE” and n = 4 ApoE/PR3/NE ~~ mice. (D) Number of 
total intact Ly6G-stained neutrophils per aortic root section field of view (FOV) measured in immunostained 
micrographs; eight mice per strain in two independent experiments. Statistics by two-tailed, unpaired 
Student's t test: *P < 0.05, ***P < 0.001, ****P < 0.0001. 
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psoriatic inflammation (26). Hence, the reduc- 
tion in neutrophil recruitment in APOE/PR3/NE- 
deficient lesions was not due to intrinsic defects in 
neutrophil chemotaxis or extravasation. Therefore, 
NET-mediated priming of macrophages promotes 
a self-amplifying IL-1-IL-17 cascade and uncovers a 
mechanism for neutrophils to regulate T helper 17 
(T}717) cells that sustains chronic sterile inflam- 
mation (fig. S13C). 

Our data reveal a requirement of NETs as 
priming cues in vivo and show that NETs are 
substantially more potent in priming cytokines 
than in activating the inflammasome. Although 
other endogenous molecules such as oxidized 
LDL can prime in vitro, their importance in vivo 
has not been demonstrated (7, 27). Interestingly, 
antibodies against oxidized LDL in lesions rec- 
ognize oxidized phospholipids that suppress 
inflammation (28). NETosis may prime more 
efficiently than necrosis (29, 30), as it effectively 
exposes highly decondensed and proinflamma- 
tory DNA (31). 

A recent study using Cl-amidine proposed that 
NETs drive a plasmacytoid dendritic cell (pDC)- 
derived interferon-o. (IFN-o.) autoimmune cascade 
in atherosclerosis via TLR9 ligation (20). However, 
TLR9 deficiency has little effect on atherogenesis 
(32) Moreover, PAD enzymes are expressed in 
many cell types (33) and were dispensable for 
NETosis triggered by cholesterol crystals (fig. SIE). 
Whereas IFN signaling down-regulates IL-1 expres- 
sion, genetic ablation of the IFNa/f receptor yields 
a modest 25% decrease in lesion size (34) because 
IFNs may contribute to pathogenesis via the up- 
regulation of caspases (3, 35, 36). 

NET priming may drive inflammation in other 
NET-associated diseases such as cystic fibrosis 
and rheumatoid arthritis (24, 37). In contrast to 
the broad importance of IL-1 and IL-17, NETs play 
more specialized roles in immune defense (4) 
and NET-deficient individuals are primarily sus- 
ceptible to localized fungal infection (38). Hence, 
blocking NETosis or degrading NETs may help to 
treat inflammatory diseases. 
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HIV-1 VACCINES 


Protective efficacy of adenovirus/protein 
vaccines against SIV challenges 


in rhesus monkeys 
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Preclinical studies of viral vector-based HIV-1 vaccine candidates have previously shown 
partial protection against neutralization-resistant virus challenges in rhesus monkeys. 

In this study, we evaluated the protective efficacy of adenovirus serotype 26 (Ad26) vector 
priming followed by purified envelope (Env) glycoprotein boosting. Rhesus monkeys 
primed with Ad26 vectors expressing SIVsmE543 Env, Gag, and Pol and boosted with 
ASO1B-adjuvanted SIVmac32H Env gp140 demonstrated complete protection in 50% of 
vaccinated animals against a series of repeated, heterologous, intrarectal SIVmac251 
challenges that infected all controls. Protective efficacy correlated with the functionality of 
Env-specific antibody responses. Comparable protection was also observed with a similar 
Ad/Env vaccine against repeated, heterologous, intrarectal SHIV-SF162P3 challenges. 
These data demonstrate robust protection by Ad/Env vaccines against acquisition of 
neutralization-resistant virus challenges in rhesus monkeys. 


espite the urgent need for a safe and ef- 

fective global HIV-1 vaccine, only four 

vaccine concepts have been evaluated for 

protective efficacy in humans during more 

than 30 years (J, 2). In rhesus monkeys, 
vaccine protection has been reported against 
neutralization-sensitive viruses (3), but these 
data failed to predict protective efficacy in hu- 
mans (4), which suggests the importance of 
using neutralization-resistant virus challenges 
for preclinical evaluation of HIV-1 and SIV vac- 
cine candidates. We previously showed that prim- 
ing with adenovirus vectors and boosting with 
poxvirus vectors expressing Env, Gag, and Pol 
resulted in a reduced per-exposure acquisition 
risk after challenges with neutralization-resistant 
SIVmac251, but the majority of these animals 


320 = 17 JULY 2015 « VOL 349 ISSUE 6245 


were infected at the end of the challenge series 
(5, 6). To augment antibody responses, we eval- 
uated the immunogenicity and protective efficacy 
of priming with adenovirus vectors and boost- 
ing with adjuvanted Env gp140 protein against 
SIVmac251 and SHIV-SF162P3 challenges in rhesus 
monkeys. 

We immunized 32 adult rhesus monkeys 
(Macaca mulatta) that did not express the pro- 
tective major histocompatibility complex class I 
alleles Mamu-A*01, Mamu-B*08, or Mamu-B*17 
with adenovirus serotype 26 (Ad26) vectors (7) 
expressing SIVsmE543 Env/Gag/Pol antigens 
(5) followed by either SIVmac32H Env gp140 
protein (8) (Ad/Env; n = 12) or Ad35 vectors (9) 
expressing SIVsmE543 Env/Gag/Pol antigens 
(Ad Alone; 7 = 12), and a control group received 


sham vaccines (Sham; n = 8). Animals in the 
Ad/Env group were primed with 3 x 10”° viral 
particles (vp) Ad26-Env/Gag/Pol vectors (10° vp 
per vector) by the intramuscular route at weeks 
O and 24 and were boosted with 0.25 mg Env 
gp140 with ASO1B Adjuvant System at weeks 52, 
56, and 60. Animals in the Ad Alone group were 
primed with 3 x 10”° vp Ad26-Env/Gag/Pol vec- 
tors at weeks 0 and 24 and were boosted with 
3 x 10°° vp Ad35-Env/Gag/Pol at week 52. One 
control animal died before challenge for rea- 
sons unrelated to the study protocol and was 
excluded from the analysis. 

Binding antibody responses to heterologous 
SIVmac239 Env gp140 were detected by enzyme- 
linked immunosorbent assay (ELISA) (JO) in all 
vaccinated animals after Ad26 priming at weeks 
4 and 28 (Fig. 1A). In the Ad/Env group, ELISA 
end-point titers increased from 5.3 logs at week 
28 to 6.4 logs after the SIV Env gp140 boosts at 
week 64 (P < 0.0001) (Fig. 1A), which confirmed 
that the Env boost effectively augmented Ad26- 
primed antibody responses. Neutralizing antibody 
(NAb) responses assessed in the TZM-bl cell line 
(11) against tier 1 heterologous SIVmac251_TCLA.15 
and homologous SIVsmE660 CP3C-P-A8 viruses 
also increased significantly after SIV Env gp140 
boosting (fig. S1). NAb responses against tier 2 
viruses were borderline (fig. $1). 

In addition to neutralization, antibodies me- 
diate a wide variety of additional antiviral func- 
tions through their ability to interact with Fc 
receptors, complement, and lectin-like proteins 
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(12, 13). Previous studies showed that antibody- 
dependent cellular phagocytosis (ADCP) (/4) and 
antibody-dependent complement deposition (ADCD) 
responses correlated with protective efficacy in 
rhesus monkeys (6). To perform a comprehensive 
analysis of vaccine-elicited antibody responses, 
we evaluated 150 independent antibody Fe pa- 
rameters by high-throughput antibody profiling, 
including multiple assessments of antibody Fe func- 
tionality [ADCP, ADCD, antibody-dependent cell- 
mediated cytotoxicity (ADCC), antibody-dependent 
NK cell expression of CD107a, interferon-y (IFN-y), 
and the chemokine CCLA1, isotypes, glycosylation, 
complement binding, and Fc receptor binding 
(14-18). Integration of all 3600 data points in a 
principal-component analysis demonstrated that 
the Ad/Env vaccine and the Ad Alone vaccine 
elicited Env-specific antibodies that were pheno- 
typically distinct (P < 0.0001) (Fig. 1B). A loadings 
plot (Fig. 1C) showed the distribution of all mea- 
sured Fc features in the same multidimensional 
space, which demonstrated the specific features 
that drove the separation of antibody profiles 
(red arrows). Partial least-squares discriminant 
analysis (19) revealed that the six antibody Fc 
functions described above nearly completely sep- 


A P<0.0001 


P=0.0007 


Component 2 


e P<0.0001 


4 5 6 7 8 9 10 11 #12 13 
Component 1 


Fig. 1. Humoral immune responses elicited by the Ad26/Env SIV vaccine. (A) S|Vmac239 Env-specific ELISA titers at weeks O, 4, 
28, 64, and 96. Mean log end-point ELISA titers are shown. (B to G) Multidimensional analysis integrating 150 antibody Fc parameters 
per animal demonstrates differences in antibody Fc profiles. (B) Principal component analysis (PCA) of antibody Fc profiles from 
animals immunized with Ad/Env (red) or Ad Alone (blue) vaccines. (C) Loadings plot mirrors the multidimensional space in the PCA 


arated these groups, with the majority of anti- 
body Fe effector functions clustering with the 
Ad/Env-vaccinated animals (Fig. 1D). Univariate 
analyses showed that these antibody Fe func- 
tions were all significantly increased in Ad/Env 
group as compared with the Ad Alone group 
(Fig. 1E), and a combined analysis demonstrated 
that the number of antibody Fc functions was 
significantly greater in Ad/Env-vaccinated ani- 
mals as compared with animals vaccinated with 
Ad Alone (Fig. 1, F and G). These data show that 
the protein boost resulted in a more polyfunc- 
tional antibody Fc effector profile. 

Cellular immune responses measured by IFN-y 
enzyme-linked immunospot (ELISPOT) assays in 
response to heterologous SIVmac239 and homol- 
ogous SIVsmE543 Env/Gag/Pol peptide pools 
were also detected in all animals after vaccination 
(fig. S2). By multiparameter intracellular cytokine 
staining assays, SIV Env gp140 boosting primarily 
expanded Envy-specific IFN-y*CD4* T lymphocyte 
responses in the Ad/Env group, whereas Ad35- 
Env/Gag/Pol boosting substantially expanded 
IFN-y*CD8* T lymphocyte responses in the Ad 
Alone group (fig. S2). Both CD28*CD95" central 
and transitional memory and CD28*CD95" effector 
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memory CD4* and CD8* T lymphocyte responses 
(20, 21) were elicited by both vaccines (fig. $3). 

To evaluate the protective efficacy of these vac- 
cine regimens, all animals were challenged with 
six repeated, intrarectal inoculations with 500 tis- 
sue culture infectious doses (TCID;0) of the het- 
erologous, neutralization-resistant virus SIVmac251 
(5, 22, 23) beginning week 96 (Fig. 2, A and B). All 
control animals were infected by this challenge 
protocol. The Ad Alone vaccine regimen resulted 
in a 75% reduction in the per-exposure acquisition 
risk as compared with controls [1 - (hazard ratio); 
P = 0.039, Cox proportional hazard model], which 
is consistent with our prior studies (5). In con- 
trast, the Ad/Env vaccine regimen afforded a 90% 
reduction in the per-exposure acquisition risk as 
compared with controls (P = 0.001). Moreover, 
50% (6 of 12) of animals in this group also ap- 
peared uninfected at the end of this challenge 
protocol (P = 0.012 compared with controls, chi- 
square test; P = 0.044, Fisher’s exact test) (Fig. 2B). 
Protection in the Ad/Env group was greater than 
that in the Ad Alone group (P = 0.042, chi-square 
test; P = 0.097, Fisher’s exact test). Binding anti- 
body titers [P < 0.0001; correlation coefficient 
(R) = 0.75] and antibody Fc polyfunctionality 
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but shows the distribution of all measured Fc features, which demonstrates the features that drove the separation of antibody profiles 


(red arrows). (D) Partial least-squares discriminant analysis (PLSDA) of antibody profiles from Ad/Env and Ad Alone vaccinees. (E) Univariate analyses of 
antibody functions identified in (D). A composite (F) dot plot and (G) pie chart show the overall functionality of antibody responses elicited by the Ad/Env 
and the Ad Alone vaccines. Error bars reflect SEM. P values reflect Mann-Whitney tests. *P < 0.01, **P < 0.001, ***P < 0.0001. 
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(P = 0.004; R = 0.56) best correlated with pro- 
tection against acquisition of infection, as mea- 
sured by the number of challenges required for 
infection (Fig. 2C). Individual antibody functions 
(ADCP, ADCC, CD107, CCIA) also correlated with 
protection (P < 0.05). 

In the Ad/Env group, plasma viral loads were 
persistently negative in the six protected mon- 
keys for 400 days after challenge (Fig. 2D). Of 
the six infected animals in this group, four ani- 
mals developed measureable chronic set-point 
viremia, whereas two monkeys exhibited tran- 
sient acute viremia and subsequently became elite 
controllers with undetectable plasma viral loads 
(Fig. 2D and fig. S4). In the Ad Alone group, plas- 
ma viral loads were persistently negative in 2 of 
12 monkeys, and chronic viremia developed in 
10 of 12 animals. In contrast, all sham controls 
developed high levels of chronic viremia with a 
median set-point viral load from days 100 to 
400 after infection of 6.03 log copies/ml, which 
was at least 1.65 logs as high as the median set- 
point viral load in the animals in the Ad/Env 
group that became infected (P = 0.035) (fig. S5). 


A — Adenv (N=12) B 


w= Ad Alone (N=12) 
=== Sham (N=7) 
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Fig. 2. Protective efficacy Cc 7 
of the Ad26/Env SIV 
vaccine against repeated, 
intrarectal SIVmac251 
challenges. (A) Number of 
challenges required for 
acquisition of infection in 
each vaccine group. (B) 
Statistical analyses include 
the hazard ratio with 95% 
confidence interval and 
the per-exposure reduction 6 
of acquisition risk in 

each group, with P values 
reflecting Cox proportional 
hazard models and log-rank 
tests. Additional statistical 
analyses include the 
percentage of completely 
protected animals at the 
end of the challenge series, 
with P values reflecting 
chi-square tests and Fisher's 
exact tests. (C) Correlation 
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We previously reported that progressive SIV 
infection correlated with a marked expansion of 
the enteric virome in rhesus monkeys, particu- 
larly for picornavirus reads (24-27). Metagenom- 
ics sequencing of stool samples in the present 
study demonstrated that the enteric virome ex- 
panded by week 28 but not by week 10 in the 
sham controls (P = 0.015) (fig. S6). Both the Ad/Env 
and the Ad Alone vaccines reduced the expan- 
sion of total enteric reads including enteric picor- 
naviruses (P = 0.002 and P = 0.042, respectively) 
(fig. S6). The Ad/Env vaccine also reduced AIDS- 
related mortality as compared with the sham 
controls (P = 0.020) (fig. $7). 

We next investigated whether the vaccinated 
animals that exhibited persistently negative plas- 
ma Viral loads were completely protected by com- 
prehensive tissue analyses, adoptive transfer 
studies, and immunologic assays. We performed 
necropsies on the six protected animals in the 
Ad/Env group, the two protected animals in the 
Ad Aone group, and one of the elite controllers in 
the Ad/Env group at ~400 days after challenge 
(Fig. 2D). All of these animals had negative plasma 


Hazard Ratio 
(95% Cl) 


Per-Exposure P-Value vs | P-Value vs | Complete 
Risk Reduction | Sham! Sham? Protection 


viral loads at the time of necropsy. We assessed 
36 gastrointestinal, lymphoid, and reproductive 
tract tissues per animal (28 tissues in males) by 
ultrasensitive nested quantitative polymerase chain 
reaction or quantitative reverse transcription poly- 
merase chain reaction assays for SIV DNA and 
SIV RNA as previously described (28). Viral DNA 
and RNA were readily detectable in all tissues in 
the elite controller (Fig. 3A, red circles) but not 
in the eight protected animals (Fig. 3A, black cir- 
cles), except for one viral signal in a single animal, 
which is within the range of expected background 
false-positive signals in similar analyses of naive 
animals (28). 

We next performed adoptive transfer studies 
and infused 60 million peripheral blood and lymph 
node mononuclear cells by the intravenous route 
from the eight apparently protected animals and 
the two elite controllers into naive rhesus mon- 
key hosts. Cells from the elite controllers readily 
transferred infection and resulted in plasma 
viral loads of 6.87 to 7.12 log copies/ml in naive 
recipients by day 14 after adoptive transfer (Fig. 
3B, red lines). In contrast, cells from the 
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of log ELISA titers and antibody Fc polyfunctionality at week 64 with the number of challenges required to establish infection. The plotted data reflect only 
vaccinated animals and do not include the sham controls. Values plotted as >6 challenges reflect animals that remained uninfected. Overlapping data 
points are shown as a single symbol. P values reflect Spearman rank correlation tests. (D) Plasma SIV RNA copies/ml over time in infected and protected 
animals in each vaccine group. Red asterisks indicate elite controllers. + indicates mortality. 
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protected animals failed to transfer infection 
(Fig. 3B, black lines). 

Furthermore, the protected animals exhibited 
no increase in Env/Gag/Pol-specific cellular im- 
mune responses after challenge and also no re- 
sponses to Vif, which was not included in the 
vaccine, whereas vaccinated animals that became 
infected developed massive anamnestic Env/Gag/ 
Pol-specific cellular immune responses and pri- 
mary Vif-specific responses (fig. S8). The protected 
animals also exhibited no anamnestic Env-specific 
ELISA antibody responses after challenge. Taken 
together (Fig. 3 and fig. S8), these data strongly 
suggest that the Ad/Env vaccine afforded com- 
plete sterilizing protection in 50% of animals against 
the SIVmac251 challenge protocol and that the 
mechanism of protection involved primary block- 
ing of acquisition of infection. 

To confirm these findings with analogous vac- 
cines expressing HIV-1 immunogens, we used a 


A 


group of 20 rhesus monkeys that had been im- 
munized previously at weeks 0 and 40 with 
Ad26 and Ad5HVR48 vectors expressing mo- 
saic, consensus, or natural clade C HIV-1 Env/ 
Gag/Pol immunogens (29). Two years after Ad 
priming, these animals were boosted six times 
with 0.25 mg HIV-1 clade C C97ZA012 Env gp140 
(10, 30) with the ASOIB Adjuvant System at weeks 
156, 160, 164, 176, 180, and 184 (Ad/Env; 7 = 20). 
A second group of animals received only 0.25 mg 
Env gp140 with ASOIB at the same six time points 
(Env Alone; n = 8), and a third control group re- 
ceived sham vaccines (Sham; 7 = 12). The Ad/Env 
vaccine elicited greater antibody responses than 
did the Env Alone vaccine by ELISA (fig. S9); func- 
tional nonneutralizing antibody assays (6, 14, 15) 
(fig. S9); tier 1 NAb TZM-bl assays (fig. S10); tier 2 
NAb A3R5 assays (fig. S11); and linear peptide 
microarray assays (31, 32), including variable re- 
gion 2-specific responses (33, 34) (fig. S12). Pro- 
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tective efficacy was assessed by six intrarectal 
challenges with 500 TCIDso of the heterologous, 
neutralization-resistant virus SHIV-SF162P3 (6) 
beginning week 196 (Fig. 4, A and B). Although 
the Env Alone vaccine afforded only minimal 
protection, 40% (8 of 20) of Ad/Env-vaccinated 
animals were completely protected against this 
challenge series (P = 0.006 compared with con- 
trols, chi-square test; P = 0.014, Fisher’s exact test) 
(Fig. 4B and figs. S13 to $15). Binding antibody 
titers (P = 0.008) and ADCP responses (P = 0.001) 
correlated with protection against acquisition 
of infection. 

Our data demonstrate the protective efficacy 
of Ad/Env vaccine regimens against SI[Vmac251 
and SHIV-SF162P3 challenges in rhesus monkeys 
and suggest that the Env protein boost improved 
protective efficacy by enhancing the functional- 
ity of vaccine-elicited, Env-specific antibody re- 
sponses. In contrast, DNA/Ad5 vaccines afforded 
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Fig. 3. Tissue analyses and adoptive transfer studies in SIVmac251 challenged animals. 
(A) Ultrasensitive SIV DNA and SIV RNA assays in multiple tissues from one elite controller (red) 
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equivalents are shown. (B) Plasma SIV RNA copies/ml in previously naive recipient monkeys after 
adoptive transfer of 60 million peripheral blood and lymph node mononuclear cells from the two 
elite controllers (red) or the eight apparently completely protected monkeys (black). Only 1 week 
of follow-up was available for one of the protected animals. 
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Fig. 4. Protective efficacy of the Ad/Env HIV-1 vaccine against repeated, intrarectal SHIV-SF162P3 challenges. (A) Number of challenges required for 
acquisition of infection in each vaccine group. (B) Statistical analyses include the hazard ratio with 95% confidence interval and the per exposure reduction of 
acquisition risk in each group, with P values reflecting Cox proportional hazard models and log-rank tests. Additional statistical analyses include the percentage 
of completely protected animals at the end of the challenge series, with P values reflecting chi-square tests and Fisher's exact tests. 
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no protection against SI[Vmac251 challenges (3), 
which reflects the ability of DNA/Ad5 vaccines 
to block only neutralization-sensitive virus clones 
(35). Alphavirus vector priming and Env pro- 
tein boosting afforded partial protection against 
the neutralization-sensitive virus SHIV-SF162P4 
but was not evaluated against neutralization- 
resistant viruses (36). Rhesus cytomegalovirus 
(CMV) vectors failed to block acquisition of in- 
fection but afforded post-infection virologic con- 
trol and eventual viral clearance in about half of 
the animals after SIVmac239 challenges (28, 37). 

The protective efficacy of Ad/Env vaccines 
against acquisition of neutralization-resistant 
virus challenges in rhesus monkeys in the present 
study has important implications for HIV-1 vac- 
cine development and suggests the potential of 
Env protein boosting after Ad vector priming. 
Nevertheless, important differences exist be- 
tween SIV/SHIV infection in rhesus monkeys 
and HIV-1 infection in humans. Clinical efficacy 
studies are therefore required to determine the 
protective efficacy of these HIV-1 vaccine candi- 
dates in humans. 
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CIRCADIAN RHYTHMS 


A protein fold switch joins the 
circadian oscillator to clock output 


in cyanobacteria 


Yong-Gang Chang,’ Susan E. Cohen,” Connie Phong,” William K. Myers,* 
Yong-Ick Kim,” Roger Tseng,”* Jenny Lin,? Li Zhang,’ Joseph S. Boyd,” Yvonne Lee,® 
Shannon Kang,° David Lee,’ Sheng Li,’ R. David Britt,* Michael J. Rust,” 


Susan S. Golden,”® Andy LiWang?”*>*9* 


Organisms are adapted to the relentless cycles of day and night, because they evolved 
timekeeping systems called circadian clocks, which regulate biological activities with 
~24-hour rhythms. The clock of cyanobacteria is driven by a three-protein oscillator 
composed of KaiA, KaiB, and KaiC, which together generate a circadian rhythm of KaiC 
phosphorylation. We show that KaiB flips between two distinct three-dimensional folds, 
and its rare transition to an active state provides a time delay that is required to match the 
timing of the oscillator to that of Earth’s rotation. Once KaiB switches folds, it binds 
phosphorylated KaiC and captures KaiA, which initiates a phase transition of the circadian 
cycle, and it regulates components of the clock-output pathway, which provides the link 
that joins the timekeeping and signaling functions of the oscillator. 


ndogenous circadian (~24-hour) rhythms 
are found in diverse organisms, arising 
as an adaptation to Earth’s persistent 
cycles of night and day (7). To uncover the 
molecular mechanism of a circadian clock, 
we chose the cyanobacterial system because its 
oscillator can be reconstituted in vitro (2). The 
oscillator is composed of only three proteins KaiA, 
KaiB, and KaiC (3), which together generate a 
circadian rhythm of KaiC phosphorylation at 
residues serine 431 (S431) and threonine 432 
(T432) in the CII domain (4, 5). KaiA promotes 
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KaiC (auto)phosphorylation during the subjec- 
tive day (4, 6), whereas KaiB provides negative 
feedback to inhibit KaiA (7, 8) and promotes 
KaiC (auto)dephosphorylation during the sub- 
jective night. KaiB is also involved in regulating 
two antagonistic clock-output proteins—SasA 
(9) and CikA (10), which reciprocally control the 
master regulator of transcription, RpaA (71). 

To determine the structure of KaiB in its 
KaiC-bound state, we used a monomeric var- 
iant of the KaiB-binding domain of KaiC, CI*, 
and a dimeric KaiB variant (12), KaiB*, with en- 
hanced KaiC binding. Dimeric forms of free KaiB 
retain the same tertiary structure in crystals as 
tetrameric forms (13). Free KaiB has been shown 
by x-ray crystallography (14) to adopt a fold 
found in no other protein (15), despite clear se- 
quence similarity with the thioredoxin-like fold 
at the N terminus of SasA, N-SasA (9). For struc- 
tural studies, we used proteins from Thermo- 
synechococcus elongatus (denoted by “), because 
they are more stable than those from Synecho- 
coccus elongatus (16). For functional studies, we 
used proteins from S. elongatus (denoted by °°), 
the standard model for investigating in vivo 
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circadian rhythms (J7). Analytical ultracentrifuga- 
tion experiments indicated that KaiB®* binds to 
cl'** as a monomer with a stoichiometric ratio of 
1:1 (fig. SIA). Secondary chemical shifts of back- 
bone resonances (18) of KaiB** in a complex 
with CI'** (fig. S1) revealed a thioredoxin-like 
secondary structure (BoBoBBa) (79), rather than 
the secondary structure of free KaiB (BaBBaaB) 
found in protein crystals (Fig. 1A). Hereafter, 
we refer to the BaBfpaop form of KaiB as the 
ground state (gsKaiB), and the BaBaBpoa state as 
fold-switched (fsKaiB). Fewer than 10 proteins 
are known to switch reversibly between distinct 
folds under native conditions, and they are col- 
lectively known as metamorphic proteins (20). 
KaiB is the only metamorphic protein known to 
function in biological clocks. 

Along a f strand, side chains typically alter- 
nate t|7|.... In the B4 strand of gsKaiB, the side 
chain pattern is + |-+, where the dash is G89; in 
fsKaiB, G89 lies in the a3 helix. We reasoned 
that a G89A substitution would destabilize §4 in 
gsKaiB but not a3 in fsKaiB (Fig. 1B). A D91R sub- 
stitution should also destabilize gsKaiB. Nuclear 
magnetic resonance secondary chemical-shift 
analysis revealed that, unlike KaiB'*, the two 
single-point mutants had populations of both 
gsKaiB and fsKaiB states, but the double mu- 
tant was =>98% in the fsKaiB state (figs. S2 to 
S6). A structural model of G89A,D91R-KaiB‘* 
determined by CS-Rosetta (27), using chemical 
shifts and backbone amide H‘-'H™ nuclear 
Overhauser effects as restraints, confirmed that 
G89A,D91R-KaiB'* adopted a thioredoxin-like 
fold (fig. S7), similar to that of N-SasA (22). 

The corresponding KaiB variants in S. elongatus, 
G88A-KaiB**, D9OR-KaiB*’, and G88A,D90R- 
KaiB** also promoted the fsKaiB state relative 
to wild-type (WT) KaiB*° (figs. S8 and S9), al- 
though to a lesser extent than in the correspond- 
ing T. elongatus variants. G88A,D90R-KaiB*° 
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formed a complex with CI°™, with near-complete 
binding within 5 min (fig. S10). In contrast, WT 
KaiB® bound CI°*™ marginally, even after 24 hours 
(fig. S11). In vitro oscillation assays showed that 
the KaiB** variants disrupted KaiC** phospho- 
rylation rhythms (Fig. 2A and fig. $12). Amounts 
of KaiC** phosphorylation were larger in the 
presence of D9OR-KaiB® or G88A,D90R-KaiB®* 
than they were with G88A-KaiB®. G88A-KaiB*° 
formed a complex with KaiA® (Fig. 2B), whereas 
the two KaiB** mutants containing D9OR did 
not (fig. S13), which indicated that although 
the D9OR mutation promotes the fsKaiB state, 
it also disrupts binding. Larger amounts of un- 
sequestered KaiA would be expected to lead to 
larger amounts of KaiC phosphorylation, which 
would account for the observed differences in 
the KaiC phosphorylation profiles when the D9OR 
mutants were used. In vivo bioluminescence 
rhythms from cyanobacterial luciferase report- 
er strains harboring xaiB** variants (Fig. 2C) 
were also disrupted, with phenotypes that close- 
ly matched the in vitro phosphorylation patterns 
(Fig. 2A). A chromosomal copy of kaiB* in addi- 
tion to the mutant XazB variants did not restore 
bioluminescence rhythms in vivo, which indi- 
cated a dominant-negative effect of the fold- 
switch mutations (Fig. 2D and fig. S14). KaiB 
fold switch-stabilizing mutants are less abun- 
dant in vivo than WT KaiB (fig. S15), so at equi- 
librium in vivo fsKaiB is probably rare. 
Although KaiB® variants disrupted rhythms 
in vitro and in vivo, each of them restored the 
cell-length phenotype in kazB strains (Fig. 2, E 
and F) in which the SasA-RpaA output path- 
way was hyperstimulated and cell division was 
inhibited (23). These functional variants indi- 
cate that fsKaiB regulates the clock-output 
enzymes SasA (9) and CikA (0, 23). SasA is 
activated when it binds KaiC (24), and SasA 
and KaiB compete for the CI domain of KaiC 


KaiB'® B 
X-ray (PDB: 2QKE) 
secondary structure 


KaiB’® 
(gsKaiB) 


KaiB'** free 


KaiB'®* bound to cl'®* 


(25). Preincubation with the D9OR-KaiB® and 
G88A,D90R-KaiB~ variants inhibited the abil- 
ity of S431E-KaiC* (a KaiC variant that mimics 
phosphorylation at the S431 residue) to trigger 
output signaling through SasA*° (Fig. 2G and 
fig. S16). Also, mixtures of KaiB and KaiC ac- 
tivate the phosphatase activity of CikA that de- 
phosphorylates RpaA (26). Relative to WT KaiB®, 
fsKaiB** variants enhanced CikA** phosphatase 
activity by about threefold in vitro (Fig. 2H and 
fig. S17) and suppressed RpaA** phosphorylation 
in vivo (fig. S18). We propose that fsKaiB forms 
a complex with KaiC that both activates signal- 
ing through CikA and inhibits signaling through 
SasA by outcompeting SasA for binding to KaicC. 
It is likely that fsKaiB interacts with the CikA 
pseudo-receiver domain (PsR-CikA), because 
adding PsR-CikA to the in vitro oscillator short- 
ened the period and reduced the amplitude 
(Fig. 2I and fig. S19). Overexpression of just the 
PsR-CikA domain in cyanobacteria similarly 
shortened the period of bioluminescence rhythms 
(27). Interaction with PsR-CikA was detected 
for G88A,D90R-KaiB®, but not KaiB*° (fig. S20). 
Addition of the PsR domain of KaiA did not af- 
fect phosphorylation rhythms (Fig. 2J), even 
though it has the same tertiary structure as 
PsR-CikA (6, 28). 

fsKaiB/gsKaiB equilibrium constants, K = 
kx,/kg_, were estimated by fitting the kinetics 
of binding of KaiB‘* variants to CI'** (fig. S21). 
Equilibrium constants were larger for G89A- 
KaiB'° (K = 0.13 + 0.02), D9IR-KaiB* (1.2 + 0.1), 
and G89A,D91R-KaiB™ (6.7 + 0.4), relative to 
KaiB’ (0.08 + 0.01). For KaiB‘ variants bind- 
ing S431E-KaiC“, a distinct fast-binding phase 
was followed by slow binding (Fig. 3A). These 
multiphase binding kinetics were reproduced 
in a gsKaiB < fsKaiB fold-switching model (Fig. 
3B), in which KaiB was initially at equilibrium 
between the two folds. The pool of fsKaiB bound 


N-SasAS® 
(PDB: 1T4Y) 


G89A,D91R-KaiB'** 
(fsKaiB) 


Fig. 1. KaiB switches its fold to bind KaiC. (A) Plots of chemical shift-based 
secondary structures of KaiB'**, KaiB'®* + Cl'®* and G89A,D91R-KaiB'** determined 
by TALOS+ (18). Unassigned proline and nonproline residues are indicated by small 


G89A,D91R-KaiB'* 


red and blue dots along the horizontal axis at y = 0. The secondary structures of 


KaiB® and G89A,D91R-KaiB'** are shown for comparison. KaiB'* residues Q52 to 
E56 in the KaiB*** to Cl*®* complex were not assignable, probably owing to exchange 
broadening. Vertical dashed lines are visual guides separating the N-terminal and 


G89A,D91R-KaiB* 
CS-Rosetta (this study) 


secondary structure switching. 


C-terminal halves of KaiB. (B) Structural comparisons of KaiB®, G89A,D91R-KaiB** 
and N-SasA*®. Residues K58, G89, and D91 are highlighted for their roles in fold 
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rapidly upon adding S431E-KaiC (Fig. 3C), fol- 
lowed by a slow gsKaiB — fsKaiB population 
shift. As shown by our computational model 
(Fig. 3B and fig. S22), the slow formation of the 
KaiB-KaiC complex contributes to the delay 
that allows a population of KaiC proteins to be- 
come highly phosphorylated under continued 
stimulation by KaiA. Increasing the rate of KaiB- 
KaiC binding in our model causes the in silico 
phosphorylation rhythm to fail (Fig. 3D), similar 
to that observed in vitro. Comparing the kinetics 
of binding to CI (fig. S21) and full-length KaiC 
(Fig. 3A) by KaiB mutants shows that full-length 
KaiC contributes to the slow phase as well, and 


this is likely due to the CI adenosine triphospha- 
tase (29) exposing the KaiB-binding site upon its 
activation by CI-CII ring stacking (12). 

N-SasA, which also adopts a thioredoxin-like 
fold (22), competes with KaiB for binding KaiC 
(25) (fig. S23). Intermolecular distances for the 
complexes CI‘**-G89A,D91R-KaiB‘™* and CI'**- 
N-SasA“ (figs. S24 and $25) were measured by 
using the pulsed electron paramagnetic reso- 
nance method of double electron-electron reso- 
nance (DEER), in combination with mutagenesis 
studies (figs. S26 to S29). The higher-quality DEER 
data for the CI**-N-SasA“ complex allowed struc- 
tural modeling (Fig. 4A and figs. S30 and S31). 


In this model, the a2 helix of N-SasA binds to the 
B-loop region of the CI domain (residues A109to 
D123), with additional interactions involving the 
CI a-helix that follows. Hydrogen-deuterium ex- 
change mass spectrometry (HDX-MS) data on 
both complexes (Fig. 4, A and B, and figs. S32 
to S35) also suggest that this region of KaiC is a 
common binding site for KaiB and SasA. This 
model is consistent with a report that truncation 
of the B loop abolished binding (25). B-loop trun- 
cation also restored the cell-length phenotype of 
a strain that lacked kaiB (fig. S36). Furthermore, 
a F121A mutation in the B loop of full-length KaiC 
(F121A-S431E-KaiC*®*) abolished KaiC binding 
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Fig. 2. KaiB fold switching regulates oscillator function and clock output. 
(A) In vitro KaiC phosphorylation assays using KaiC*®, KaiA®°, and KaiB*®, 
G88A-KaiB°°, DOOR-KaiB*®, or G88A,D90R-KaiB**. (B) Gel-filtration profiles 
of G88A-KaiB*’, KaiA®°, and G88A-KaiB** + KaiA®®. Peaks (a) to (c) were ana- 
lyzed by SDS-PAGE (fig. S13). (C) Bioluminescence from strains that carry a 
Pxaig luc reporter for circadian rhythmicity. Cells harbored kaiB*°, G88A-kaiB*°, 
D9OR-kaiB*°, or G88A,D90R-kaiB*, or cells with kaiB°* deletion. Time in LL, time 
under low-light conditions. (D) Bioluminescence from strains that carry a Pais 
luc reporter expressing kaiB*°, G88A-kaiB*°, D9OR-kaiB*°, G88A, D9OR-kaiB*®, 
or empty vector, in addition to chromosomal kaiB**. (E) Representative mi- 
crographs of cells expressing kaiB*®, lacking kaiB*°, or harboring G88A,D9OR- 
kaiB°©. Cellular autofluorescence in red. Scale bars, 2.5 um. (F) Histograms 
showing cell-length distributions of strains expressing kaiB*°, AkaiB*°, G88A- 
kaiB°*, D9OR-kaiB**, or G88A,D90R-kaiB** as the only copy of kaiB. (G) SasA*° 
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kinase activities in the presence of S431E-KaiC** and KaiB*’, G88A-KaiB*, 
D90R-KaiB*, or G88A,D9OR-KaiB*. The mixtures were incubated for 2 hours 
before SasA°°, RpaA®* and [y-°*P]ATP (labeled adenosine triphosphate) were 
added. Relative kinase activities compare the mean steady-state amount of 
32B_labeled RpaA** to that of a reaction of S431E-KaiC** alone (n = 4, error bars 
denote SEM). One-way analysis of variance (ANOVA) gives P < 0.001, and **** 
denotes Bonferroni-corrected values (P < 0.001) for pairwise comparisons 
against kinase activity with KaiB** (a = 0.05). (H) CikA phosphatase activity 
toward phosphorylated RpaA in the presence of KaiC** and KaiB**, G88A- 
KaiB**, D9OR-KaiB**, or G88A,D9OR-KaiB** (n = 4—5, error bars denote SEM). 
KaiC** alone or KaiB** alone did not activate CikA phosphatase activity (fig. 
S17). (I) In vitro KaiC** phosphorylation assays as a function of concentration 
of PsR-CikA®*®. (J) Same as (I) except for using PsR-KaiA** instead of PSR-CikA**. 
The black curves in (I) and (J) are identical. 
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to full-length SasA**, KaiB**, and G88A,D90R- 
KaiB® (figs. S37 to S40). HDX-MS data (Fig. 4, A 
and B) indicated that N-SasA and KaiB induce 
long-range perturbations in isolated CI domains 
that, in the context of full-length KaiC, may affect 


Fig. 3. KaiB fold-switching A 
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t = 24 hours—were derived from fitting data after adding S431E-KaiC* in (A) 
to the model shown in (B). Burst-phase error bars show the standard de- 
viation from model calculations by bootstrap resampling the raw data (n = 20). 
kgs values used in these fits were predetermined from analysis of the kinetics 
of binding of KaiB® variants to the isolated Cl*®* domain (fig. $21), a condi- 
tion where we assumed the rate-limiting step in complex formation is due 


A 


intersubunit interactions. Another HDX-MS- 
derived model (30) indicated that KaiB most 
likely binds to CII, on the basis of a calculation 
minimizing the docking energy between KaiC 
and gsKaiB, not fsKaiB. 
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Naturally occurring variations at residue po- 
sitions 58, 89, and 91 of KaiB (fig. S41A) suggest 
that the ancestral KaiB protein had the fsKaiB 
thioredoxin-like fold, and the gsKaiB fold evolved 
later with the circadian clock. In support of 
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only to KaiB fold switching. Error bars for kg, were estimated by bootstrap 
resampling the original data set 500 times. (D) Mathematical modeling of 
KaiC phosphorylation period (black) and probability of stable oscillation 
(purple), as a function of kgs. The black bars indicate the standard deviation 
in the model period from 100 oscillator calculations at each value of kgt, 
with the other parameters randomly varied as described in supplementary 
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Fig. 4. KaiB and SasA bind to similar sites on Cl. (A) An electron paramagnetic resonance)-restrained model of the Cl*®*-N-SasA’® complex. The 
HADDOCK model of the complex with the best score is superimposed on the crystal structure of KaiC’® (PDB ID: 400m). (B) Qualitative structural model of 
the interaction of Cl'®* and fsKaiB (G89A,D91R-KaiB**) based on HDX-MS data and mutagenesis. Dark blue and cyan spheres represent Cl residues whose 
mutations strongly or moderately weaken binding, respectively. Dark blue and cyan ribbons represent protection against H/D exchange upon complex 
formation that are >1.5 and 0.5 to 1.5 standard deviations above the average, respectively, as determined by HDX-MS (figs. S32 to $35). 
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Fig. 5. Model of KaiB fold 
switching as linchpin for the 
cyanobacterial clock. Excur- 
sion of KaiB to the rare fold- 
switch state causes fsKaiB to 
displace SasA for binding to 
KaiC. KaiC-stabilized fsKaiB 
captures KaiA, initiating the 
dephosphorylation phase of 
the cycle. These aspects con- 
trol oscillator period. CikA and 
KaiA compete for binding to 
fsKaiB, which further links 
oscillator function related to 
KaiA and output activity via 
CikA-mediated dephosphoryl- 
ation of RpaA. The competitive 
interactions of fsKaiB with 


KaiB fold switching 


rhythmic gene expression 
cell division 


SasA, and KaiA with CikA, implicate “output components” CikA and SasA as parts of an extended oscillator. 


this notion, a homolog of KaiB from Legionella 
pneumophila, with no known circadian rhythms, 
has an alanyl residue at position 89 and crys- 
tallizes in the fsKaiB fold (31) (fig. S41B). Rare 
excursions of KaiB between two distinct folds are 
essential for a robust circadian period and re- 
ciprocally regulate mutually antagonistic clock 
output-signaling pathways (Fig. 5). 


REFERENCES AND NOTES 


1. J.C. Dunlap, Cell 96, 271-290 (1999). 

2. M. Nakajima et al., Science 308, 414-415 (2005). 

3. M. Ishiura et al., Science 281, 1519-1523 (1998). 

4. H. Iwasaki, T. Nishiwaki, Y. Kitayama, M. Nakajima, T. Kondo, 
Proc. Natl. Acad. Sci. U.S.A. 99, 15788-15793 (2002). 

5. Single-letter abbreviations for the amino acid residues are as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; 
|, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; 

R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 

6. S. B. Williams, |. Vakonakis, S. S. Golden, A. C. LiWang, 
Proc. Natl. Acad. Sci. U.S.A. 99, 15357-15362 (2002). 

7. Y. Kitayama, H. Iwasaki, T. Nishiwaki, T. Kondo, EMBO J. 22, 
2127-2134 (2003). 

8. Y. Xu, T. Mori, C. H. Johnson, EMBO J. 22, 2117-2126 
(2003). 

9. H. Iwasaki et al., Cell 101, 223-233 (2000). 

0. O. Schmitz, M. Katayama, S. B. Williams, T. Kondo, 

S. S. Golden, Science 289, 765-768 (2000). 

1. J. S. Markson, J. R. Piechura, A. M. Puszynska, E. K. O'Shea, 
Cell 155, 1396-1408 (2013). 

2. Y. G. Chang, R. Tseng, N. W. Kuo, A. LiWang, Proc. Natl. Acad. 

Sci. U.S.A. 109, 16847-16851 (2012). 
3. R. G. Garces, N. Wu, W. Gillon, E. F. Pai, EMBO J. 23, 
1688-1698 (2004). 
A. S.A. Villarreal et al., J. Mol. Biol. 425, 3311-3324 
(2013). 

5. L. Holm, P. Rosenstrém, Nucleic Acids Res. 38 (Web Server), 
W545-W549 (2010). 

6. |. Vakonakis et al., Proc. Natl. Acad. Sci. U.S.A. 101, 1479-1484 
(2004). 

7. G. Dong, Y. |. Kim, S. S. Golden, Curr. Opin. Genet. Dev. 20, 
619-625 (2010). 

8. Y. Shen, F. Delaglio, G. Cornilescu, A. Bax, J. Biomol. NMR 44, 

213-223 (2009). 

J. L. Martin, Structure 3, 245-250 (1995). 

20. A. G. Murzin, Science 320, 1725-1726 (2008). 

21. S. Raman et al., Science 327, 1014-1018 
(2010). 

22. |. Vakonakis, D. A. Klewer, S. B. Williams, S. S. Golden, 

A. C. LiWang, J. Mol. Biol. 342, 9-17 (2004). 

23. G. Dong et al., Cell 140, 529-539 (2010). 

24, R. M. Smith, S. B. Williams, Proc. Natl. Acad. Sci. U.S.A. 103, 

8564-8569 (2006). 

25. R. Tseng et al., J. Mol. Biol. 426, 389-402 (2014). 


328  17JULY 2015 + VOL 349 ISSUE 6245 


26. A. Gutu, E. K. O'Shea, Mol. Cell 50, 288-294 (2013). 

27. X. Zhang, G. Dong, S. S. Golden, Mol. Microbiol. 60, 658-668 (2006). 

28. T. Gao, X. Zhang, N. B. Ivleva, S. S. Golden, A. LiWang, Protein 
Sci. 16, 465-475 (2007). 


29. C. Phong, J. S. Markson, C. M. Wilhoite, M. J. Rust, Proc. Natl. 
Acad. Sci. U.S.A. 110, 1124-1129 (2013). 

30. J. Snijder et al., Proc. Natl. Acad. Sci. U.S.A. 111, 1379-1384 
(2014). 

31. M. Loza-Correa et al., Environ. Microbiol. 16, 359-381 
(2014). 


ACKNOWLEDGMENTS 


We thank R. Greenspan, J.-P. Changeux, M. Paddock, Y. Shen, 
R. Peterson, S. Chou, and N.-W. Kuo for discussions and A. Chavan 
for figure preparation. Work was supported by Air Force Office 
of Scientific Research grant 13RSLO12 and NIH grant GM107521 
to A.L., a Burroughs-Wellcome Career Award at the Scientific 
Interface to M.J.R., NIH grants GM100116 and GM062419 to 
S.S.G. and Al081982 and Al101436 for support of the University 
of California-San Diego HDX-MS Laboratory, American 

Cancer Society Postdoctoral Fellowship PF-12-262-01-MPC to 
S.E.C., and NSF Graduate Research Fellowship to R.T. The data 
reported in this paper are tabulated in the supplementary 
materials. 


SUPPLEMENTARY MATERIALS 


www.sciencemag.org/content/349/6245/324/suppl/DC1 
Materials and Methods 

Figs. S1 to S41 

Tables S1 to S13 

References (32-58) 


15 August 2014; accepted 8 May 2015 
Published online 25 June 2015; 
10.1126/science.1260031 


SEX DETERMINATION 


foxl3 is a germ cell-intrinsic factor 
involved in sperm-egg fate decision 


in medaka 


Toshiya Nishimura,” Tetsuya Sato,?** Yasuhiro Yamamoto, Ikuko Watakabe,' 
Yasuyuki Ohkawa,” Mikita Suyama,** Satoru Kobayashi,”*** Minoru Tanaka’”+ 


Sex determination is an essential step in the commitment of a germ cell to a sperm or 
egg. However, the intrinsic factors that determine the sexual fate of vertebrate germ 
cells are unknown. Here, we show that fox/3, which is expressed in germ cells but not 
somatic cells in the gonad, is involved in sperm-egg fate decision in medaka fish. Adult XX 
medaka with disrupted foxl3 developed functional sperm in the expanded germinal 
epithelium of a histologically functional ovary. In chimeric medaka, mutant germ cells 
initiated spermatogenesis in female wild-type gonad. These results indicate that a germ 
cell-intrinsic cue for the sperm-egg fate decision is present in medaka and that 
spermatogenesis can proceed in a female gonadal environment. 


n vertebrates, gonadal somatic cells instruct 
germ cells to adopt their sexual fates. In 
medaka (Oryzias latipes), the expression of 
DMY/dmrtibY in the supporting somatic cells 
is critical for the fate decision of germ cells to 
enter spermatogenesis (1, 2). However, the molec- 
ular mechanism underlying the sexual fate deci- 
sion in germ cells remains unknown. In this study, 
we show that foxl3 serves as a germ cell-intrinsic 
cue for sperm-egg fate decision. 
foxl3 is an ancient duplicated copy of foxl2 
that is expressed in gonads of teleost fish (3-5). 
Although jozvl2 is known to be essential for ovarian 
development and maintenance (6-9), the function 
of foxl3 is not known. foxl3 transcripts and FOXL3 
protein (f0x13/FOXL3) were first detectable in germ 


cells of both XX and XY embryos at stage 35, the 
time of onset of gonadal sex differentiation (figs. S1, 
A to D, and S2, A to D). After this stage in XX 
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Fig. 1. Dimorphic expression of FOXL3 during gonadal development. (A) Schematic representation 
of type | and type II division and expression of FOXL3 protein (red) in germ cells (green). “Active” and 
“quiescent” indicate mitotic states. (B) Ventral view of the gonad, observed with confocal laser 
microscopy. FOXL3 expression (red) in XX gonad at 7 days after hatching. FOXL3 signals are detected in 
a subset of type | (1) and all mitotic type II (2, 2-cell cyst; 3, 8-cell cyst) germ cells, but not in meiotic 
germ cells (4, zygotene; 5, pachytene) or diplotene oocytes (6). (©) The number of FOXL3-positive cells 
during gonadal development in XX and XY gonads. Values are expressed as means + SEM. The number 
of samples examined at each stage is shown in fig. S2. st., stage; dph, days post-hatching. Scale bars, 
10 um. 


gonads, germ cells can be categorized into two 
types according to their division (Fig. 1A) (JO, 11). 
Type I is stem-type self-renewal division, in which 
germ cells divide completely to generate two iso- 
lated daughter cells surrounded by supporting 
cells. Type I includes both mitotically active and 
quiescent germ cells (72). Type II is cystic division, 
in which germ cells divide synchronously with 
intercellular bridges. Type II is gametogenesis- 
committed division, followed by meiosis and 
oogenesis. In XY gonads, the transition from 
type I to type II is suppressed until 1 month after 
hatching. 

foxl3/FOXL3 appeared in a subset of mitoti- 
cally active type I germ cells, but not in quiescent 
type I germ cells, at stage 35 onward in both XX 
and XY gonads (Fig. 1, A and B, and fig. $3). In 
XX gonads, the signals continued to be detected 
in type II germ cells but disappeared in meiotic 
germ cells and oocytes (Fig. 1B and fig. $1, E and 
G). Whereas foxl3/FOXL3 were detected through- 
out gonadal development in XX fish, they disap- 
peared in all germ cells of XY fish by 10 days after 
hatching (Fig. 1C and figs. S1 and S2). 

To examine the function of foxl3 in the gonad- 
al sex differentiation, we generated transcription 
activator-like effector nuclease (TALEN)-induced 
mutants of foxl3. We designed two types of 
TALENs targeting foxl3 and obtained two dif- 
ferent mutant alleles (NA17 and FHA8), which 
contain frameshifts causing premature trunca- 
tion upstream of and at the forkhead domain of 
FOXL3, respectively (fig. S4, A to D). Antiserum 
recognizing the C terminus of FOXL3 protein 
detected the protein in both heterozygous (+/-) 
mutants but failed to detect it in both homozy- 
gous (-/-) mutants (fig. S4, E to H), indicating 
that foxl3 was successfully disrupted by TALEN. 
One week after hatching, a stage at which oocytes 
are formed in wild-type XX gonads (Fig. 1B), the 
‘foxl3 ~ XX gonads had no oocytes but were 
instead filled with cystic and meiotic germ cells 
(Fig. 2, A to C), suggesting that commitment to 
type II division and meiosis is not affected by loss 


Fig. 2. foxl3 is responsible for suppressing initiation of spermatogenesis in medaka germ cells. (A to D) Ventral view of a foxl3-’~ XX gonad at 7 days 
after hatching. The white boxes are magnified in (B) to (D). Green cells (olvas—enhanced green fluorescent protein) represent germ cells. Meiotic germ cells at 
zygotene (B) and pachytene [(C), dotted line] stages, and spermatid-like cells (D). Red dots in germ cells represent OLVAS protein. (E) Detection of protamine 


(purple signals) in foxl3-’- XX gonad at 10 days after hatching. Scale bars, 10 um. 
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Fig. 3. Progression of spermatogenesis in a female gonadal environ- 
ment. (A) RT-PCR analysis. Ov, ovary; Ts, testis. (B and C) Cross sections 
of adult gonads with immunohistochemistry. (B) Wild-type ovary. (Inset) A 
germinal cradle, in which a germline stem cell (arrow), cystic germ cells 
(yellow dotted line), and a diplotene oocyte (arrowhead) are surrounded by 
sox9b-expressing cells (red). (C) foxi3-’~ XX gonad. (Inset) Expanded ger- 
minal epithelium in which sox9b-expressing cells (red) surround germ cells 
(green). The yellow arrowhead indicates an oocyte. Scale bars, 50 um. (D to 
G) Cross sections of adult gonads subjected to periodic acid—Schiff staining. 
(D) Wild-type ovary. (Inset) The germinal cradle hetween basement mem- 


brane (red arrowheads) and epithelial cells (black arrowhead) of the ger- 
minal epithelium (yellow dotted lines). (E) foxi3-’~ XX gonad. Spermatogenesis 
progresses within the expanded germinal epithelium (yellow dotted lines) 
that resides between the basement membrane (bottom inset, red arrow- 
heads) and epithelial cells (top inset, black arrowheads). The arrow indicates 
an oocyte in the stromal compartment. Asterisks in (B to E) indicate ovarian 
cavities. [(F) and (G)] Spermatogenesis proceeds as units of cysts [(F), red 
dotted lines] from distal (d) to proximal (—p) position in wild-type testis (F), 
but this arrangement is disorganized in the foxl3-’~ XX gonad (G). Scale 
bars, 10 um. 


Fig. 4. foxl3-’~ XX gonads produce func- A 100 = B 100 
tional sperm. (A) Fertilization rate of _ es 
artificial insemination by using sperm from & 80 & 80 
wild-type testes (n = 3) and foxi3-“~ ovaries 8 s g ES 
(n = 8). In each artificial insemination, one c if = 
gonad was used. (B) Hatching rate of the 2 40. = 40 
fertilized eggs in (A). (C) Unfertilized eggs. 3 cj 
(D) Eggs fertilized by sperm derived from 5 20 = 


foxl3-’~ ovaries. The activated egg mem- 
brane is shown in (D’). (E) Embryos hatched 
from the eggs in (D). Statistics by two-tailed 
Student's t test; *P < 0.05. 


of foxl3. Instead of oocytes, spermatid-like cells 
were present at the periphery of gonads in both 
mutant lines (NAI7 in Fig. 2, A and D, and FHA8 
in fig. S4H); these cells expressed protamine, a 
sperm-specific gene (Fig. 2E and fig. S5). In ad- 
dition, other markers for spermatogenesis such 
as kif17, tektin-t, and shippol were also expressed 
in foxl3- XX gonads (fig. $5). At around the same 
stage in normal XY gonads, only type I germ cells 
are present (fig. S2J), and spermatogenesis does 
not begin until puberty, which occurs later than 
1 month after hatching. This indicates that in 
foul” XX mutants, spermatogenesis begins much 
earlier than puberty in wild-type males. 
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Chimeric analysis revealed that foxl3/- germ 
cells completed spermatogenesis in wild-type XX 
gonads, indicating that the phenotype is due to 
loss of foxl3 function in germ cells but is not 
affected by gonadal somatic cells (fig. S6). Fur- 
thermore, introduction of a bacterial artificial 
chromosome (BAC) containing fox/3 allele restored 
the formation of oocytes in fovl3~ XX larvae (fig. 
S87). Collectively, these data indicate that foxl3 is a 
germline-intrinsic factor involved in sperm-egg 
fate decision in medaka. 

To characterize the gonadal structures pro- 
ducing sperm in the XX mutants, we performed 
in situ hybridization and reverse transcription 


polymerase chain reaction (RT-PCR) using sex- 
specific markers. Throughout gonadal develop- 
ment from 10 to 30 days after hatching and in 
adult gonads of foxl3’/- XX fish, the female 
markers foxl2 and aromatase were expressed in 
somatic cells surrounding spermatogenic cells, 
whereas the male markers dmrtI and p450118 
were absent, reflecting the normal female ex- 
pression pattern in somatic cells (Fig. 3A and 
figs. S8, A to F, and S9, A to F). In adults, the 
secondary sex characteristics, which are distin- 
guished by the morphology of dorsal/anal fins 
and urogenital papilla (73), were of the female 
type in the XX mutants (fig. S10, A and B). Con- 
sistent with this, the foal3’~ XX gonads exhibited 
morphologically ovarian structures characterized 
by an ovarian cavity on the dorsal side and a 
stromal compartment on the ventral side (Fig. 3, 
B and C, and fig. $10, C and D). In wild-type ova- 
ries, a thin multilayered tissue called the germi- 
nal epithelium separates the ovarian cavity from 
the stromal compartment (fig. S11A). In addition, 
the germinal cradle, a unit consisting of sox9b- 
expressing cells that harbor germline stem cells 
and support early oogenesis (Fig. 3B), lies be- 
tween the epithelial cells of the germinal epithe- 
lium and the basement membrane bordering 
the stromal compartment (Fig. 3D and fig. SIIB) 
(14). In foxl3/- XX gonads, spermatogenesis pro- 
ceeded within the expanded germinal epithelium 
(Fig. 3E and fig. S11). Early stages of spermatogenic 
cells were inclined to localize on the basal side of 
the expanded germinal epithelium underlain by 
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the basement membrane and were surrounded by 
sox9b-expressing cells, forming a structure similar 
to the germinal cradles (Fig. 3C and figs. S11 and 
S12). A subset of these germ cells expressed nanos2, 
a marker of germline stem cells (J4), suggesting 
that germline stem cells were established in the 
absence of foxl3 function (fig. S9, G to I. 

In wild-type testes, spermatogenesis proceeds 
synchronously in cysts composed of Sertoli cells 
surrounding spermatogenic cells at the same 
stage. The cysts are arranged along a distal-to- 
proximal axis, so that the spermatogonia are 
located most distally and matured spermatids 
are released into the efferent duct in the most 
proximal position (Fig. 3F). In the expanded ger- 
minal epithelium of foal3’/- XX gonads, how- 
ever, the distal-to-proximal arrangement of the 
cysts was disorganized, so that spermatogenic 
cells at various stages were adjacent to each 
other (Fig. 3G). This observation suggests that 
feminized somatic cells are not able to arrange 
masculinized germ cell cysts in the proper distal- 
to-proximal manner, as Sertoli cells would other- 
wise do in wild-type testes. At the inner side of 
the expanded germinal epithelium, meiotic sper- 
matogenic cells did not always contact sox9b- 
expressing cells (fig. $12). Therefore, it is possible 
that spermatocytes in foal37/- XX gonads could 
differentiate autonomously into sperm within the 
expanded germinal epithelium, as previously re- 
ported occurring in culture (15). Occasionally, 
oocytes appeared in between spermatogenic cells 
in the expanded germinal epithelium (Fig. 30), 
which subsequently exited as follicles into the 
stromal compartment (Fig. 3E), as seen in follicle 
formations of the wild-type ovary (fig. S11) (14). 
Collectively, the expanded germinal epithelium in 
foxl3’'- XX gonads harbors germline stem cells 
and is able to support follicle formation but can- 
not organize the cysts in a distal-proximal direc- 
tion. Any morphological trait of efferent ducts 
was not observed, possibly leading to the large 
accumulation of spermatogenic cells within the 
germinal epithelium of foal3’~ XX gonads. We 
conclude that foals! ~ XX mutants undergo sper- 
matogenesis in histologically functional ovaries. 

On the other hand, foxl3"/" XY adult gonads 
exhibited morphologically normal testes with 
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functional sperm (fig. S10, E to H, L, and M), 
indicating that foxl3 is dispensable for male 
development. In addition, chimeric analysis revealed 
that fouls! ~ germ cells did not initiate preco- 
cious spermatogenesis in wild-type male gonads 
(fig. S6, B and D). These results imply that male, but 
not female, somatic cells are able to properly reg- 
ulate spermatogenesis in the absence of foxl3. The 
absence of male somatic cell regulation may allow 
foxl3 germ cells to complete spermatogenesis 
much earlier than puberty in wild-type males. 

To investigate whether the sperm produced by 
foxl3'- XX mutants were functional, we per- 
formed artificial insemination. The gonads from 
2- or 3-month-old XX mutants were minced with 
forceps to suspend sperm in medaka Ringer’s 
solution, and ovulated eggs of wild-type females 
were subsequently inseminated by the resultant 
sperm. Approximately 50% of the eggs were suc- 
cessfully fertilized (Fig. 4A), as confirmed by the 
activation of the egg membrane (Fig. 4, C and D). 
The fertilized eggs exhibited normal development, 
and more than 95% of them hatched (Fig. 4, B and 
E). Therefore, the foxl3’/- XX mutant produced 
functional sperm. Furthermore, foals! ~ XX mu- 
tants spontaneously spawned a few fertile eggs, 
although their fertility—as assessed by the number 
of spawned eggs, the fertilization rate, and the 
hatching rate—was lower than that of the foul3”! 7 
mutants (fig. S10, I to K). 

The presence of eggs in foxl3/- XX mutants 
suggests the existence of a foxl3-independent 
pathway controlling oocyte formation. The oocytes 
did not appear until around 20 to 30 days after 
hatching (fig. S8, G to I) in the mutants, a stage at 
which an ovarian cavity starts to form because of 
the action of estradiol (E2) (6). However, inhibi- 
tion of aromatase activity and E2 signaling by the 
administration of fadrozole (FAD) and tamoxifen 
(TAM), respectively, did not block oocyte formation 
in the mutants (fig. S13). On the other hand, ad- 
ministration of E2 during the embryonic and larval 
stages of both XX and XY foxl3~ mutants did not 
induce the precocious oocyte formation observed 
in E2-treated fovl3*/” XY larva (fig. S14). These 
findings suggest that after derepression of sperma- 
togenesis upon loss of foxl3 expression, oocytes are 
formed in an E2-independent manner. 


Our findings suggest that suppression of sperm- 
atogenesis by foxl3 is an important component of 
female fate decision in germ cells. In this respect, 
we note that another gene of forkhead box tran- 
scriptional factor, foxl2, is involved in the mainte- 
nance of female fate in the mammalian supporting 
cells by suppressing the male fate (8). 
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The Department of Biochemistry and Molecular Biophysics at Washington 
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research. Applicants seeking tenured positions must have a strong record of 
external funding. 


Outstanding individuals working in any area of biochemistry and molecular 
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spans a wide range of topics including computational biology, membrane 
proteins, molecular motors, nucleic acid / protein interactions, protein 
structure, enzymology and signal transduction. Additional information about 
the department is available at http://www.biochem.wustl.edu. Washington 
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interdisciplinary graduate and medical scientist training programs. Minority 
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disadvantaged populations. 
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(strain accessions, deaccessions, quality control), information management 
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presentations, position papers, manuscripts), and business operations (personnel, 
user accounts, billing, financial recordkeeping and reporting). The new BDSC 
scientist will participate in all aspects of BDSC operations and eventually 
assume a primary leadership role in the organization. Candidates should have 
extensive knowledge of Drosophila genetics, excellent communication skills, 
proven grantsmanship, well-developed organizational abilities, interest in a long- 
term position, and a keen desire to support the Drosophila research community. 


Applicants must hold a Ph.D. degree and have relevant postdoctoral experience 
with a solid record of research accomplishments. Experienced scientists are 
encouraged to apply. All applications received by September 1, 2015 will be 
considered. Applicants should submit a cover letter, CV and the names of three 
references to http://indiana.peopleadmin.com/postings/1603. Please address 
inquiries to Jeremy Bennett at (812) 855-6283; jebennet@indiana.edu; 1001 
E. Third St., Bloomington, IN 47405-7005. 


Indiana University is an Equal Employment and Affirmative Action Employer 
and a provider of ADA services. All qualified applicants will receive 
consideration for employment without regard to age, ethnicity, color, race, 
religion, sex, sexual orientation or identity, national origin, disability status 
or protected veteran status. 


Center for Immunology and 
Microbial Disease 
Albany Medical College 
Faculty Position 


The Center for Immunology & Microbial Disease at Albany Medical 
College invites applications for a tenure-track, junior or senior faculty 
position from individuals who have a doctoral degree, postdoctoral 
experience, and demonstrated research productivity. Those with an 
interest in host-pathogen interactions are particularly encouraged to apply. 
The successful candidate will be expected to establish an independent, 
extramurally-funded research program and participate in the teaching of 
medical and graduate students. The basic science departments at Albany 
Medical College are organized as interdisciplinary research centers and 
the Center for Immunology & Microbial Disease has a focus on microbial 
pathogenesis and immune defense, particularly as related to biothreat 
agents and emerging infections. The new faculty recruit will receive a 
competitive salary, an attractive start-up package, laboratory space in our 
newly constructed research building, and access to all departmental core 
services including the Center’s fully-staffed Immunology and ABSL-3/ 
BSL-3 Cores. Albany Medical College is located in a mid-sized city within 
the upstate New York Capital Region, and has easy access to Boston, New 
York City, and the Adirondack Mountains. 


Applicants should send their curriculum vitae, a statement of research 
plans, and three letters of reference to: 
Faculty Search Committee 
Center for Immunology & Microbial Disease 
Albany Medical College 
47 New Scotland Avenue, MC-151 
Albany, NY 12208 


For further information about the Center, visit: 
www.amce.edu/Academic/Research/imd.htm 


An Equal Opportunity/Affirmative Action Employer. 
Women andminorities are encouraged to apply. 


Scientific Director 
For the Center for Predictive Medicine for 
Biodefense and Emerging Infectious Diseases 


The University of Louisville seeks applications and nominations for the 
position of Scientific Director of its Center for Predictive Medicine for 
Biodefense and Emerging Infectious Diseases (CPM). The Scientific Director 
will work closely with the team responsible for managing the NIH-affiliated 
Regional Biocontainment Laboratory (BSL-2/ABSL-2,BSL-3/ABSL-3) at the 
University of Louisville and will be expected to lead research and educational 
activities that contribute towards meeting its operational and financial 
requirements. The Director will report to the Executive Vice President for 
Research and Innovation and will be appointed to the Faculty of the academic 
college and Department that is appropriate to their education and expertise. 


The ideal applicant will have a PhD, MD, or equivalent degree, a proven 
publication record and an established research program with history of 
funding support. Experience and/or expertise in a regulated environment is 
preferable. Applicants should review information at the CPM website http:// 
louisville.edu/predictivemedicine. Applicants should submit a CV, the names 
and contact information of four scientific references, and a letter describing 
the applicant’s experience in leading a multidisciplinary research program 
that may encompass collaborations with individuals, academic institutions, 
government, industry, contract labs, and/or foundations. In addition, a 
statement that outlines plans for leading the CPM as its Scientific Director 
should be provided. 


Applications should be received by September 15, 2015 for full consideration, 
although applications will continue to be reviewed until the position is 
filled. Nominations and applications will remain confidential until invited 
for campus interview. 


All applications and requested material must be submitted on-line at: https:// 
highereddecisions.com/uofl/current_vacancies.asp Job ID UL374. Use the 
left margin option, View Vacancy Announcement to read the ad, and/or the 
Applicant Guide to apply. 


The University of Louisville is an Affirmative YN|VERSI 


Action, Equal Opportunity, Americans with 
Disabilities Employer. LO U 4 S 
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The space roboticist 


he first motorized vehicle that Vandi Verma ever operated was a tractor. “I must’ve been 
11 years old at the time,” she told Science. During school vacations, she visited her grandpar- 
ents, who lived in a village in central India. At their farm, her uncle let her take a few turns 
behind the tractor wheel. Later, when she was a teenager, her father, who was a pilot with 
the Indian Air Force, taught her how to drive a car. That was unusual in India at that time, 
where those who could afford a car hired a driver. 


Today, Verma is one of the few peo- 
ple in the world who is qualified to 
drive a vehicle on Mars. 

Verma majored in electrical en- 
gineering in India and came to the 
United States to study artificial intel- 
ligence. She was captivated by the 
landing of the Sojourner Mars rover 
in 1997 and decided to apply her en- 
gineering skill to space exploration. 
She pursued a Ph.D. in robotics at 
Carnegie Mellon University and did 
internships with NASA’s Ames Re- 
search Center. She also got her first 
taste of robotic exploration here on 
Earth by field testing a rover that 
surveyed South America’s Atacama 
Desert for signs of life. 


‘Tam happy to be ... pushing 


commands to see the results from 
Curiosity, and every sol (a martian 
day), we were doing something we'd 
never done before,” Verma says. 

Curiosity can scoop dirt, drill rock, 
and hand off samples to the onboard 
lab. While a sample is being ana- 
lyzed, Curiosity is already on its way 
to the next site. Verma helped write 
the code that lets Curiosity juggle 
these tasks. “We have to drive on 
to find newer things for the slew of 
instruments to analyze without com- 
promising the rover hardware or the 
sample,” she says. 

She loves her day-to-day respon- 
sibility for the machine. “You defi- 
nitely don’t want to be the one who 


After graduating, in 2005, Verma the envelope on space drove the rover off a cliff! But I find 
joined the intelligent systems divi- I ° ” it energizing rather than stressful. 
sion at Ames Research Center as a exp oration. You're completely focused.” 


research scientist. Later, she moved 

to NASA’s Jet Propulsion Lab (JPL), the command center 
for the Mars rover missions. There, her robotics expertise 
and experience with field testing rovers won her a chance 
to drive the Opportunity rover. Verma drove Opportunity for 
3 years before graduating to the nuclear-powered Curiosity 
rover, which is now prowling Mars, examining its rocks to see 
whether it is, or was ever, a suitable habitat for life. 

Each day, before the rover shuts down for the frigid mar- 
tian night, it calls home, Verma says. Besides relaying scien- 
tific data and images it gathered during the day, it sends its 
precise coordinates. They are downloaded into simulation 
software Verma helped write. The software helps drivers plan 
the rover’s route for the next day, simulating tricky maneu- 
vers. Operators may even perform a dry run with a duplicate 
rover on a sandy replica of the planet’s surface in JPL’s Mars 
Yard. Then the full day’s itinerary is beamed to the rover so 
that it can set off purposefully each dawn. 

For the first 3 months after landing, from 5 August to 
5 November 2012, the team was working on Mars time. “I 
loved that we didn’t have to wait long after we uplinked our 


But she has not left research be- 
hind. One of Verma’s key research goals has been to give 
rovers greater autonomy to decide on a course of action. She 
is now working on a software upgrade that will let Curiosity 
be true to its name. It will allow the rover to autonomously 
select interesting rocks, stopping in the middle of a long 
drive to take high-resolution images or analyze a rock with 
its laser, without any prompting from Earth. 

Originally designed for a 2-year mission, Curiosity is still 
going strong and has already made many scientifically signif- 
icant finds. “With every drive, we get to explore new terrain 
that no human has seen in this kind of detail,’ Verma says. 

Although human spacefaring has stalled, Verma says the 
spirit of exploration is alive and well in space robots. “I am 
happy to be working in robotics, pushing the envelope on 
space exploration,’ she says. “We have reached Mars, our 
neighboring planet. We have only just begun.” & 


Vijaysree Venkatraman is a Boston-based science journal- 
ist. For more on life and careers, visit sciencecareers.org. 
Send your story to SciCareerEditor@aaas.org. 
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